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Preface 



This proceedings volume includes papers for presentation at DGCI 2000 in Upp- 
sala, Sweden, 13-15 December 2000. DGCI 2000 is the 9th in a series of interna- 
tional conferences on Discrete Geometry for Computer Imagery. DGCI 2000 is 
organized by the Centre for Image Analysis (Swedish University of Agricultural 
Sciences and Uppsala University). It is sponsored by the International Associa- 
tion for Pattern Recognition (lAPR) and by the Swedish Society for Automated 
Image Analysis (SSAB). This is the first DGCI conference to take place outside 
France. 

The number of researchers active in the field of discrete geometry and com- 
puter imagery is increasing. In comparison with the previous editions, DGCI 2000 
has attracted a significantly larger number of research contributions from aca- 
demic and research institutions in different countries. In fact, 62 papers were sub- 
mitted. The contributions focus on topology, discrete images, surfaces and vol- 
umes, shape representation, and shape understanding. After careful reviewing by 
an international board of reviewers, 40 papers from 16 countries were accepted, 
leading to a high-quality conference. Of the 40 accepted papers, 28 were selected 
for oral presentation and 12 for poster presentation. All accepted contributions 
have been scheduled in plenary sessions. The programme also has been enriched 
by three invited lectures, presented by three internationally known speakers: 
Pieter Jonker (Delft University of Technology, The Netherlands), Pierre Soille 
(ISPRA, Italy), and Jayaram K. Udupa (University of Pennsylvania, USA). 

We hope that we have made DGGI 2000 an unforgettable event, especially for 
researchers from outside Sweden. In fact, the uniquely Swedish Lucia celebration 
of returning light in the middle of winter has been scheduled in the early morning 
of 13 December, just before starting the DGGI technical programme. Also, the 
conference dates were chosen to overlap with the week when the Nobel Prizes are 
awarded in Stockholm and the Nobel Laureates are giving lectures in Uppsala. 
In our opinion, these events should be very stimulating for all researchers. 

Hereby, we would like to thank the contributors who responded to the call 
for papers, the invited speakers, all reviewers and members of the Steering, 
Programme, and Local Gommittees, as well as DGGI participants. We are also 
grateful to the sponsoring organizations that provided financial help, indispens- 
able to a successful conference. 
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Abstract. The main contribution of this paper is a new “extrinsic” dig- 
ital fundamental group that can be readily generalized to define higher 
homotopy groups for arbitrary digital spaces. We show that the digital 
fundamental group of a digital object is naturally isomorphic to the fun- 
damental group of its continuous analogue. In addition, we state a digital 
version of the Seifert-Van Kampen theorem. 

Keywords: Digital homotopy, digital fundamental group, lighting func- 
tions, Seifert-Van Kampen theorem. 



1 Introduction 

Thinning is an important pre-processing operation in pattern recognition whose 
goal is to reduce a digital image into a “topologically equivalent skeleton”. In 
particular, thinning algorithms must preserve “tunnels” when processing three- 
dimensional digital images. As it was pointed out in [4], this requirement can 
be correctly established by means of an appropriate digital counterpart of the 
classical fundamental group in algebraic topology; see [11]. 

The first notion of a digital fundamental group (and even of higher homo- 
topy groups) is dued to Khalimsky ([3]). He gave an “extrinsic” definition of this 
notion for a special class of digital spaces based on a topology on the set 7Z ^ , for 
every positive integer n. However, this approach is not suitable for other kinds 
of digital spaces often used in image processing, as the (a, /3)-connected spaces, 
where {a, f3) e {(4, 8), (8, 4)} if n = 2 and {a, f3) e {(6, 26), (26, 6), (6, 18), (18, 6)} 
if n = 3. Within the graph-theoretical approach to Digital Topology, Kong solved 
partially this problem in [4] by defining an “intrinsic” digital fundamental group 
for the class of strongly normal digital picture spaces (SN-DPS), which include as 
particular cases both the (a, /3)-connected spaces and the 2- and 3-dimensional 
Khalimsky’s spaces. Nevertheless, Kong’s definition seems not be general enough 
to give higher homotopy groups. 

* This work has been partially supported by the projects DGICYT PB96-1374 and 
DGES PB96-0098G04-01. 
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The goal of this paper is to introduce, via the framework of the multilevel 
architecture for Digital Topology in [2], a new notion of digital fundamental 
group (denoted by irf) that, at least from a theoretical point of view, presents 
certain advantages over the notions of Khalimsky and Kong. Firstly, the group 
irf is defined by using an “extrinsic” setting that can be readily generalized 
to define higher digital homotopy groups (see Section 3). Secondly, this group 
is available on a larger class of digital spaces than Khalimsky ’s and Kong’s 
digital fundamental groups, since the digital spaces described in the multilevel 
architecture quoted above include as examples all Khalimsky ’s spaces and the 
(<a, /3)-connected spaces (see [1]), and even most of the SN-DPS. And finally, 
our digital fundamental group of a digital object O turns out to be naturally 
isomorphic to the fundamental group of its continuous analogue; that is, of the 
continuous object perceived when one looks at O (see Section 4). This isomor- 
phism shows that the group nf is an appropriate counterpart of the ordinary 
fundamental group in continuous topology. In particular, this fact leads us to 
obtain a (restricted) digital version of the Seifert-Van Kampen Theorem (see 
Section 5). Although this theorem provides a powerful theoretical tool to obtain 
the group nf for certain digital objects, it remains as an open question to find 
an algorithm that computes this group for arbitrary objects; that is, to resem- 
ble in our framework the well-known algorithm for the fundamental group of 
polyhedra ([8]). This problem could be tackled by adapting to our multilevel 
architecture the algorithm recently developed by Malgouyres in [7] , which com- 
putes a presentation of the digital fundamental group of an object embedded in 
an arbitrary graph. 

2 The Multilevel Architecture 

In this section we briefly summarize the basic notions of the multilevel archi- 
tecture for digital topology developed in [2] as well as the notation that will be 
used through all the paper. 

In that architecture, the spatial layout of pixels in a digital image is repre- 
sented by a device models which is a homogeneously n-dimensional locally finite 
polyhedral complex K. Each n-cell in K is representing a pixel, and so the digital 
object displayed in a digital image is a subset of the set celln(AT) of n-cells in K. 
A digital space is a pair (iT, /), where AT is a device model and / is weak lighting 
function defined on K. The function / is used to provide a continuous interpre- 
tation, called continuous analogue^ for each digital object O C celln(AT). Next 
we recall the notion of weak lighting function. For this we need the following 
notation. 

Let AT be a device model and 7, a cells in AT. We shall write 7 < cr if 7 
is a face of cr, and 7 < cr if in addition 7 7^ cr. If | AT | denotes the underlying 
polyhedron of AT, a centroid-map is a map c \ K ^ \ K\ such that c(cr) belongs 
to the interior of cr; that is, c(cr) G cr — 5cr, where da = Ujy; 7 < cr} stands for 
the boundary of a. Given a cell a e K and a digital object O C celln(AT), the 
star of a in O and the extended star of a in O are respectively the digital objects 
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st^(o^; O) = {a e O; a < a} and st* ((a; O) = {cr G O; o; H cr 7 ^ 0}. The support 
ofO^ supp(O), is the set of all cells a e K such that a = n{cr; a G stn{a; O)}. To 
ease the writing, we shall use the following notation: supp(iT) = supp(celln(i^)), 
stn(<a;i^) = stn(<a; celln(i^)) and st*((a;7T) = st* (a; celln(i^)). Finally, we shall 
write V{A) for the family of all subsets of a given set A. 

Given a device model K, a weak lighting function (w.l.f.) on iT is a map 

/ : V{ce\\n{K)) X K {0, 1} satisfying the following five properties for all 

O G V{ce\\n{K)) and a e K: 

1. if q; G O then /(O, a) = 1; 

2 . \i a ^ supp(O) then f{0,a) = 0 ; 

3 . f{0,a) < f{ce\ln(K),a); 

4. f{0,a) = f{stl{a;0),a); and, 

5 . if O' C O C celln(i^) and a ^ K are such that stn(<a;0) = stn(<a;0'), 

f{0'^a) = 0 and /(O,o;) = 1, then the set of cells o;(0';0) = {[3 < 

a; f{0',f3) = 0, f{0,j3) = 1} is not empty and connected in da] moreover, 
if O C O C celln(i^), then f{0,/3) = 1 for every j3 G a{0'; O). 

A w.l.f. / is said to be strongly local if /(O,o;) = /(stn(<a; O), o;) for all a G AT 
and O C celln(i^). Notice that this strong local condition implies both properties 
4 and 5 in the definition above. 

To define the continuous analogue of a given digital object O in a digital 
space (AT, /), we need to introduce several other intermediate models (the levels 
of this multilevel architecture) as follows. 

The device level of O is the subcomplex K(0) = {a ^K]a<a^a^ O} 
of K induced by the cells in O. Notice that the map fo given by fo(0',a) = 
/(O, a)/((9', <a), for all O' C O and a G K{0)^ is a w.l.f. on K{0)^ and we call 
the pair (AT(0),/o) the digital subspace of (K^f) induced by O. 

The logical level of O is an undirected graph, whose vertices are the 
centroids of n-cells in O and two of them c(cr), c(r) are adjacent if there exists 
a common face a < cr fl r such that f{0,a) = 1 . 

The conceptual level of O is the directed graph Cq whose vertices are the 
centroids c{a) of all cells a ^ K with f{0,a) = 1, and its directed edges are 
(c((a), c(/5)) with a < (3. 

The simplicial analogue of O is the order complex Aq associated to the 
digraph Cq. That is, (xq, xi, . . . , Xm) is an m-simplex of Aq if xq, xi, . . . , Xm is 
a directed path in Cq . This simplicial complex defines the simplicial level for the 
object O in the architecture and, finally, the continuous analogue of O is the 
underlying polyhedron | Aq \ of Aq . 

For the sake of simplicity, we will usually drop “/” from the notation of the 
levels of an object. Moreover, for the whole object celln(AT) we will simply write 
Ck^ Ck and Ak for its levels. 

Example 1. In this paper it will be essential the role played by the archetypical 
device model ^ termed the standard cubical decomposition of the Euclidean n- 
space IR'^. Recall that the device model is the complex determined by the 
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collection of unit n-cubes in whose edges are parallel to the coordinate axes 
and whose centers are in the set . The centroid-map we will consider in 
associates to each cube a its barycenter c{a). In particular, if dimer = n then 
c(cr) G where dimer stands for the dimension of er. So that, every digital 
object O in can be identified with a subset of points in Henceforth 
we shall use this identification without further comment. In particular, we shall 
consider the family of digital spaces (R'^^g), for every positive integer n, where g 
is the w.l.f. given by g{0^a) = 1 if and only if stn{c^; R^) C O, for any digital 
object O C celln(i^’^) and any cell a G R^. Notice that the w.l.f. g induces in R^ 
the (2n, 3^^ — l)-connectedness (see [1, Def. 11]); that is, the generalization to 
arbitrary dimension of the (4, 8)-connectedness on . 

3 A Digital Fundamental Group 

We next introduce an “extrinsic” digital fundamental group that readily gen- 
eralizes to higher digital homotopy groups of arbitrary digital spaces. For this 
purpose, we will first define a digital-map (Def. 2), and then we will focus our 
interest in a special class of such digital-maps termed digital homotopies (Defs. 5 
and 6). 

Definition 1 . Let S C celln{K) be a digital objeet in a digital spaee (AT, /). The 
light body of K shaded with S is the set of eells 

Lb{K/S) = {aeK; f{eelln{K), a) = 1, /(5, o) = 0}; 

that is, Lb{K/S) = {o G K;c{a) G \Ak\ ~ Ids'll- Notiee that if S = 0 is the 
empty objeet then Lb{K/ 0 ) = Lb{K) = {o G K] f {eelln{K),a) = 1}. Moreover, 
Lb{K I eelln{K)) = 0. 

Definition 2 . Let (ATi, /i), (AT2, /2) be two digital spaees, with dimAT^ = ni 
(i = 1 , 2 ^, and let S\ C eelln-^{Ki) and S2 C eelln2{K2) be two digital objeets. 
A map (f : Lb{Ki/Si) Lb{K2/S2) is said to be a (digital) (5i,52)-map from 
(ATi,/i) into (AT 2 ,/ 2 ) (or, simply, a d-map denoted dhsi,S2 • 
provided 

1 . (j){eelln-^{Ki) — 5 i) C eelln2{K2) ~ S2; and, 

2. for a, (3 e Lb{Ki/Si) with a < (3 then <f>{a) < (j){l3). 

Example 2. (1) Let S' G S celln(AT) be two digital objects and {K{S),fs) 
the digital subspace of (AT, /) induced by S. Then, the inclusion Lb(AT(5)/6") C 
Lb(AT/5') is a (5",5')-map from (K{S),fs) into (AT,/). And, similarly, the in- 
clusion Lb(Ar/5") C Lb(AT/0) defines a (5',0)-map from {K,f) into itself. 

(2) Let Si C cellni(ATi) and a G cellos (AT 2 )- For any digital object S2 ^ 
celln 2 (AT 2 ) — {cr}, the constant map : Lb(ATi/*S'i) ^ Lb(AT 2 /*S' 2 ), given by 
(j)^{a) = cr, for all a G Lb(ATi/*S'i), defines a (5'i,52)-map from (ATi,/i) into 

(K 2 j 2 ) . 
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^( 2 , 1 ) ^^ 2 , 1 ) 

Fig. 1 . A ( 2 , l)-window in and its simplicial analogue 




( 3 ) The composition of digital maps is a digital map. Namely, if 

: (Kuh) ^ {K2J2) and : {K2J2) ^ (KsJs) 

are d-maps, then their composite ^Si,S2 ° ^82,83 is also a d-map from (ATi,/i) 
into {K3J3). 

A d-map from (ATi,/i) into (AT2,/2) naturally induces a simplicial map be- 
tween the simplicial analogues of K\ and K2. More precisely, if Li,L2 ^ A are 
simplicial complexes and Li \L2 = {a G Li; afl | L2 | = 0 } denotes the simplicial 
complement of L2 in Li, then it is straightforward to show 

Proposition 1 . Any d-map ^51,52 • (^25/2) induces a simplicial 

map A{^Si,82) • Aki \ As^ Ak 2 \ As2- 

Given two points x = (xi, . . . , x^), y = (^1, . . . , ym) G IR’^, we write x ^ y 
if Xi < yij for all 1 < z < m, while x-\-y will stand for the point (xi +^i, . . . , x^ + 

Vm) e 

Definition 3 . Given two points r, x G , with > 0 for 1 < i < m, we call 
a window of size r (or r-window ) of based at x to the digital subspace 

of (R^^g) induced by the digital object = {cr G cellm{R^)]x ^ c(cr) ^ x + r}, 
where {R^^g) is the digital space defined in Example 1 . 

Notice that the simplicial analogue of an r-window Vfj of R^ is (simplicially 
isomorphic to) a triangulation of a unit n-cube, where n is the number of non-zero 
coordinates in r (see Figure 1 ). Moreover, the set {y G x ^ y ^ x -\- r} are 
the centroids of the cells in Lb(F/^/ 0 ) which actually span the simplicial analogue 
of Vfj . Here Z = \ 7 Z stands for the set of points {z eJR;z = y/ 2 , y G Z,}. 

For the sake of simplicity, we shall write Vr to denote the r-window of R^ 
based at the point x = ( 0 , . . . , 0 ) G Moreover, if Vr is an r-window of 

then Lb(fA/ 0 ) = {<^0, cn, • • • , cT2r-i, cT2r} consists of 2 r + l cells such that c{ai) = 
i/2. 

With this notation we are now ready to give “extrinsic” notions of walks 
and loops, in a digital object, which will lead us to the definition of a digital 
fundamental group. 





Rafael Ayala et al. 



Definition 4. Let {K,f) be a digital spaee and S,0 ^ eelln{K) two disjoint 
digital objeets in (AT,/). A 5-walk in O of length r ^ 7Z from a to t is a 
digital {0jS)-map <pr : Lb{Vr/0) Lb{K{0 U S)/S) sueh that (jrio'o) = cr and 
(f)r{o- 2 r) = T. A 5-loop in O based dit a is a S-walk cjr sueh that 0r(c^o) = 
j)r{p'2r) — 

The juxtaposition of two given S-walks O, with 0r(<^2r) = 

is the S-walk (jr ^ (j)s ' Lb(Vr-\-s/0) Lb{K{0 U S)/S) of length r + s given by 



(j)f ^ 




if0<i<2r 
if2r<i< 2{r + s) 



Notice that the notion of a 5-walk is compatible with the definition of 5- 
path given in [1, Def. 5]. Actually, each 5-walk cjr defines a 5-path given by the 
sequence (p{(j)r) = (0r (<^2i))i=o- And, conversely, a 5-path (T^)f^Q in O can be 
associated with a family dhr of 5-walks such that <f>r ^ if and only if (j)r{o' 2 i) = 
Ti {0 < i < r) and (t)r{cr 2 i-i) ^ < T 2 i -2 H T 2 u f{0 U 5, o) = 1, /(5, a) = 0} 

(1 < i < r). However, this “extrinsic” notion of 5-walk will be more suitable 
to define the digital fundamental group of an object since, together with the 
notion of r-window, it allows us to introduce the following definition of digital 
homotopy. 

Definition 5. Let two S-walks in O of the same length r ^ 7Z from 

cr to T. We say that are digitally homotopic (or, simply, d-homotopic^ 

relative {cr, r}, and we write <j)( rel. (cr, r}, if there exists an (r, s)-window 

in and a {0,S)-map H : Lb{V(^r^s^/0) Lb{K{0 U S)/S), ealled a d- 
homotopy, sueh that H{i/ 2,0) = 0{(cr^) and H{i/2,s) = (j)‘^{ai), for 0 <i < 2r, 
and moreover H{0,j/2) = a and H{r,j/2) = r, for 0 < j < 2s. Here we use the 
identifieation H{x,y) = H{a), where c{a) = {x,y) G Z‘^ is the eentroid of a eell 
aeLb{V^r,s)m. 

Clearly, the previous definition of d-homotopy induces an equivalence relation 
between the 5-walks in O from cr to r of the same length. Moreover, it is easy 
to show that d-homotopies are compatible with the juxtaposition of 5-walks. 

The definition of d-homotopy between 5-walks of the same length can be 
extended to arbitrary 5- walks as follows. 



Definition 6. Let S-walks in O from cr to r of lengths r ^ s. We 

say that (pr 'Is d-homotopie to cps relative {cr, r}, <pr —d (ps ^f there 

exist eonstant S -loops and (p^; sueh that r r' = s -\- s' and <pr * (pj> —d 
(ps * (pl, rel. {cr,r}. 

Proposition 2. Let (pr be a S-walk in O from a to r, and cp^, cpl two eonstant S- 
loops of the same length s. Then, cp^ * <pr —d 4>r * <Pl {cr, r}. 

The proof of this proposition, although it is not immediate, can be directly 
obtained from definitions by means of an inductive argument. Moreover, from 
Proposition 2 and the remarks above, it can be easily derived that d-homotopy 
defines an equivalence relation in the set of 5-walks in O of arbitrary length. So, 
we next introduce the digital fundamental group as follows. 
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Definition 7. Let O he a digital objeet in a digital spaee The digital 

fundamental group of (9 at cr is the set 7rf{0,a) of d-homotopy elasses of 0- 
loops in O based at a provided with the produet operation [cfr] * [f’s] = [4>r * '0s]- 

Remark 1. Definition 7 can be easily extended to the definition of a digital 
fundamental group for the complement of an object 5 in a digital space us- 
ing the notion of 5-loop. Moreover, this last notion readily generalizes to give 
higher digital homotopy groups by replicating the same steps as above but 
starting with a suitable notion of m-dimensional 5-loop. More explicitly, let 
r G be a point with positive coordinates, and call boundary of an r- 

window Vr to the set of cells dVr = {a G Lh(Vr/0);c{a) G Notice 

that the boundary dAvr well-defined since Av^ triangulates the unit m-cube. 
Then define an m-dimensional S-loop in O of size r at cr as any (0, 5)-map 
(j)r : Lb(K/0) ^ Lh{K{0 U 5)/5) such that 0r|av,= cr. 

4 Isomorphism with the Continuous Fundamental Group 

As usual the fundamental group of a topological space A, 7 Ti(A, xq), is defined 
to be the set of homotopy classes of paths ^ : I = [0, 1] X that send 0 and 1 
to some fixed point xq {loops at xq). The set 7 Ti(A, xq) is given the structure of 
a group by the operation [a] • [fd] = [cr * /3], where (3 denotes the juxtaposition 
of paths. However, for a polyhedron | K \ there is an alternative definition of 
the fundamental group 7Ti{\K \ ^xq) that is more convenient for our purposes, 
so we next explain it briefly. Recall that an edge-path in | AT | from a vertex uq 
to a vertex Vn is a sequence a of vertices uq, ui, . . . , such that for each k = 
1, 2, . . . , n the vertices u^_i, span a simplex in K (possibly Vi-i = Vi). If uq = 
Un, cr is called an edge-loop. 

Given another edge-path fd = whose first vertex is the same as the 

last vertex of cr, the juxtaposition fd = is defined in the obvious way. 

The inverse of cr is cr“^ = ('C’n, 'C’n-i, • • • 

Two edge-paths cr and fd are said to be equivalent if one can be obtained 
from the other by a finite sequence of operations of the form: 

(a) if Vk-i = u/c, replace . . . , Vk-i^Vk^ • • • by . . . , . . ., or conversely replace 

. . . ,u/c, . . . by . . . ,u/c_i,u/c, . . .; or 

(b) if u/c_i, u/c, u/c+i span a simplex of K (not necessarily 2-dimensional), replace 
• • . ,u/c_i,u/c,u/c+i, . . . by . . . ,u/c_i,u/c+i, . . ., or conversely. 

This clearly sets up an equivalence relation between edge-paths, and the set 
of equivalence classes [cr] of edge-loops cr in AT, based at a vertex uq, forms a 
group 7Ti(Ar, uo) with respect to the juxtaposition of edge-loops. This group will 
be called the edge-group of AT. Moreover it can be proved 

Theorem 1. (Maunder; 3.3.9) There exists an isomorphism 7Ti(| AT |, uq) ^ 
7ri(AT, uo) whieh earries the elass [/] to the elass [cr/], where af is an edge-loop 
defined by a simplieial approximation of f. 
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Corollary 1. Let 0 ,S be two disjoint digital ohjeets of a digital spaee 
Then 7 Ti(^ous \ c(cr)) = 7 Ti(|^ous I - |,c(cr)) for any a eO. 

The proof of this corollary is a consequence of Theorem 1 and next lemma. 

Lemma 1. Let K^L C J be two full subeomplexes. Then \K\L\ = \ K\K n L\ 
is a strong deformation retraet of \ K\ — \L \ = \ K\ — \ K n L\. 

This lemma is actually Lemma 72.2 in [ 9 ] applied to the full subcomplex 
K n L ^ K (this fact is shown using that L is full in J). 

We are now ready to prove the main result of this paper. Namely, 

Theorem 2. Let O be a digital objeet in the digital spaee (AT,/). Then there 
exists an isomorphism h : 7rf(0, cr) ^ 7 Ti{\Ao |,c(ct)). 

The function h is defined as follows. Let cjr be any 0 -loop in O based at a. 
Then the sequence c{(j)r) = (c(^^(cr^)))f^Q defines and edge-loop in Ao based at 
c(cr), and so we set h{[(j)r]) = [c( 0 ^)]. Lemma 2 below, and the two immediate 
properties (1), (2), show that the function h is a well defined homomorphism of 
groups. 

( 1 ) c{(f)r * (p's) = c{(pr) * c{(p'^) 

(2) if (fr is a constant 0-loop then c{(j)r) is also a constant edge-loop. 

Lemma 2. If (jr —d 4>'s equivalent ^-loops then c{<f>r) ci'^d c{<j)'^) define both 
the same element in tvi{\ Ao |,c(cr)). 

Proof According to (2) above and the definition of equivalence between two 
0 -loops it will be enough to show that any d-homotopy H : Lb(V(^^5)/0) ^ 
Lb(iT(O)/ 0 ) between two 0 -loops 0 ^, cj)'^ of the same length r induces a contin- 
uous homotopy H : [ 0 , 1 ] x [ 0 , 1 ] ^ \Ao \ between c( 0 ^) and c( 0 ^). This fact is 
readily checked since Proposition 1 yields a simplicial map A{H) : Ao^ 

and v 4 v(^ is (simplicially isomorphic to) a triangulation of the unit square (see 
Figure 1 ). Moreover A{H) restricted to the top and the bottom of that unit 
square define c{(j)r) and c(0^) respectively, and the result follows. 

Now, let 7 be any loop in | Ao \ based at c{a). By Corollary 1 we can assume 
that 7 = is an edge-loop based at c{a) in Ao- After applying equiv- 

alence operations (a) and (b) above we can reduce the edge-loop 7 to a new 
edge-loop 7 = (c(7j)fko equivalent to 7 and such that ^2i-i /c-cells in K 
with k < n and ^2i-i — I21-2 ^ l2i (1 ^ ^ ^ By the use of 7 we define the 
following set ^(7) of 0-loops at a of length r. 

The set ^(7) consists of all 0 -loops (fr for which 0 r(<^o) = f>{(^2r) = cr, 
(j)r{(^2i-i) = l2i-i (1 ^ ^ ^ r*) and (j)r{(^2i) ^ stn(72d (0 ^ ^ ^ Noticc that 
stn(72d^) “ {l2i} if only if 72^ G O, while stn(72d^) contains at least 
two elements otherwise. This is clear since 0(72 J G Ao yields 72^ G supp(O) 
by Axiom 2 of w.l.f.’s. Notice that ^(7) = {7} if and only if 72^ G O for all 
0 < z < r; and moreover, in any case, F{^) 7^ 0 is a non-empty set. 
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Lemma 3. For each <pr G F{^), c{(j)r) is homotopic to the loop 7. Hence h is 
onto. 

Proof. Let us consider the set F{j) of edge-loops a = at c{a) such 

that ao = a2r = c(cr), 0^22-1 = 72 i-i (1 < ^ < ^) and 0^22 G stn( 72 d^) ^ 
{l2i}' Notice that <pr ^ ^(7)} U {7} C F{^). Since 7 was obtained from 

7 by transformations of types (a) and (b), they are equivalent edge-loops by 
Theorem 1 . So, it will suffice to show that a is homotopic to 7, for any a G F{'j). 
This will be done by induction on the number t{a) of vertices a2i 7^ 72^ in <a* 
For t{a) = 0 we get a = 7. Assume that a G F{j) is equivalent to 7 if 
t{a) < t — 1 . Then, for an edge-loop a G F{j) with t{a) = t let a2i any vertex 
in a such that a2i 7^ 72^- Then we have ^2i-\-i^^2i-i ^ 72 i ^ and we obtain 
a new edge-loop a G ^(7), replacing a2i by j2i i^ with t{a) = t — 1 and 
two equivalence transformations of type (b) relating a and a. Hence a is an 
edge-loop equivalent to a and, by induction hypothesis, to 7. 

Lemma 4. Any two 9 -loops in F{'j) are d-homotopic. 

Proof. It is enough to observe that the map H : Lb(V(^^i)/ 0 ) ^ Lb(iT(O)/ 0 ) 
given by H{i/ 2 , 0 ) = 0 j(cr^), H{i/ 2 , 1 ) = (j)l{cri) and TT(z/2,l/2) = 7^, for 0 < 
i < 2 r, and H{i — 1 / 2 , k) = 72^-1, for 1 < i < 2 r and k G { 0 , 1 / 2 , 1 }, is a d- 
homotopy relating and (jf^. Here we use again the identification H{x^y) = 
H{a)^ where c{a) = {x^y) G is the centroid of a cell a G Lb(H7^i)/0). 

Lemma 5. Let 7 ^ and 7 ^ he two edge-loops at c{a) in Ao such that they are 
related by an equivalence transformation of type (a) or (h). Then there exist 
9 -loops (/)*. G F{Y) (i = 1 , 2 / and a d-homotopy such that (Pr2 

Hence h is injective. 

Proof. In case 7^ is related to 7^ by a transformation of type (a), it is readily 
checked that 7^ =7^- Hence ^(7^) = ^(7^) and the result follows. And it 
suffices to check the essentially distinct twelve ways for deriving 7^ from 7^ by 
transformations of type (b) to complete the proof. 

5 A Digital Seifert— Van Kampen Theorem 

The Seifert-Van Kampen Theorem is the basic tool for computing the funda- 
mental group of a space which is built of pieces whose groups are known. The 
statement of the theorem involves the notion of push-out of groups, so we begin 
by explaining this bit of algebra. A group G is said to be the push-out of the 
solid arrow commutative diagram 
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if for any group H and homomorphisms (f 2 with (fifi = 9 ^ 2/2 there exists a 
unique homomorphism (p such that (pfi = (pi {i = 1,2). Then, the Seifert-Van 
Kampen Theorem is the following 

Theorem 3. (Th. 7.40 in [10]) Let K he a simplieial eomplex having eonneeted 
suheomplexes K\ and K 2 sueh that K = K 1 UK 2 and Kq = KiHK 2 is eonneeted. 
If Vo G Kq is a vertex then 7 Ti{K,vq) is the push-out of the diagram 



(ATo, '^ 0 ) > 7Ti (ATi, i;o) 




7Ti(A:2,^o) 






^ TTi{K,Vo) 



where ik^ and jk^ are the homomorphisms of groups indueed the obvious inelu- 
sions. 

By using explicit presentations of the groups 7Ti{Ki,vo) (i = 0,1,2) the 
Seifert-Van Kampen Theorem can be restated as follows. Suppose there are pre- 
sentations 'Ki{Ki,vo) = .. .',r\,r 2^ .. .) (i = 0, 1, 2) Then the fundamental 

group of K has the presentation 

'Ki{K,Vq) = 

r\,r\,. . . . . ,ii^{x\) = i2*{xi),ii*{x2) = i2*{x2), ■ ■ •)• 

In other words, one puts together the generators and relations from 7Ti{Ki,vo) 
and 7Ti{K2,vo), plus one relation for each generator of 7Ti{Ko,vo) which says 
that its images in 7Ti{Ki,vq) and 7ri(AT2, t’o) are equal. 

The digital analogue of the Seifert-Van Kampen Theorem is not always true 
as the following example shows. 

Example 3. Let 0\,02 be the two digital objets in the digital space {R^,g) 
shown in Figure 2. It is readily checked that both 7rf(Oi, cr) and 7 rf(( 92 ,cr) are 
trivial groups, but 7rf(Oi U 02,cr) = Zi despite of 0\, O 2 and 0\ n O 2 are 
connected digital objects. 

However we can easily derive a Digital Seifert-Van Kampen Theorem for 
certain objects in a quite large class of digital spaces. Namely, the locally strong 
digital spaces; that is, the digital spaces (AT, /) for which the lighting function / 
satisfies f{0,a) = f{stn{oi;0),a). We point out that all the (a, /3)- connected 
digital spaces on defined within the graph-theoretical approach to Digital 
Topology, for a, f] e {6, 18, 26}, are examples of locally strong digital spaces; see 
[1, Example 2]. 

Theorem 4. (Digital Seifert-Van Kampen Theorem) Let (AT, /) be a loeally 
strong digital spaee, and let O C eelln{K) be a digital objeet in (K,f) sueh that 
O = O 1 UO 2 , where O 2 and 0in02 are eonneeted digital objeets. Assume in 
addition that Aoin 02 ^ Aoi n A 02 Aoi ^ Ao (i = 1, 2}. Moreover assume 
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that for each cell a e Oi — O2 any cell r e O which is adjacent to a in O lies 
in Oi. Then, for cr G Oi H O2, Trf (O, cr) is the push-out of the diagram 

■nfiOi n02,o-) > 7rf(Oi,o-) 



7Ti(02,a) > 7rf(0,a) 

where the homomorphisms are induced by the obvious inclusions. 

The proof of this theorem is immediate consequence of Theorem 2 and 
Theorem 3 if we have at hand the equalities \Aoxn02 I = \^Oi \ H \Aq2 I and 
I Ao I = I Aoi I U I A02 I • We devote the rest of this section to check these equali- 
ties. 

Lemma 6 . If f{ 0 , a) = 1 then one of the following statements holds: 

( 1 ) stn{a; 0 ) = stn{a;Oi n O2) = stn{a;Oi) = stn{a;02); or 

( 2 ) stn{a; 0 ) = stn{a;Oi) and stn{a;Oj) = stn{a;Oi n O2), {ij} = { 1 , 2 }. 

Proof. In case stn(<a; O) = stn(<a; 0 \ H O2), we obtain (1) from the inclusions 

stn{a; Oi n O 2 ) C stn{a; Oi) C stn{a; O), (z = 1, 2). 

Otherwise, there exists a G O — (Oi H O2) with a < a. Assume cr G Oi — O2, 
then for all r G stn(cr; O) we have r G Oi by hypothesis and hence st^(cr; O) = 
stn(<a;Oi). Moreover stn(<a;02) ^ stn(<a; 0 ) = stn(<a;Oi) yields st^(cr;( 9 i H 
O2) = stn{a; O2) C stn(<a; O). 

The case cr g O2 — Oi is similar since then r G O2 (r ^ O2 yields r G Oi — O2 
and hence cr G Oi by hypothesis). 

Lemma 7 . Aoi n A02 ^ Aoin02 Ao ^ Ao-^ U Mos- so the equalities 
follow by hypothesis. 

Proof. Let c{a) G Aoi 2 ]Ao 2 ^ then f{Oi, cr) = 1 for i = 1 , 2 and hence stn(cr; OiH 
O2) = stn(cr; Oi) for some i by Lemma 6. Thus f{Oi n O2, cr) = 1 by the strong 
local condition of /, and so c(cr) G Moin02- Finally Aoi H A02 ^ Aoin02 since 
Aoin02 is a full subcomplex. 

Now let 7 = (c(7o), . • • ,c(7/c)) G Mo- Then stn( 7 /c; O) C stn{jk-i;0) C • • • C 
stn(7o; O). By Lemma 6 and the strong local condition we easily obtain 7 G Aoi 
whenever stn(7o; O) = stn(7o; Oi) {i = 1 , 2 ). 

6 Future Work 

The Digital Seifert-Van Kampen Theorem provides us a theoretical tool that, 
under certain conditions, computes the digital fundamental group of an object. 
Nevertheless, the effective computation of the digital fundamental group requires 
an algorithm to compute a presentation of this group directly at the object’s 
logical level. In a near future we will intend to develop such an algorithm for 
general digital spaces, as well as to compare the digital fundamental group in 
Def. 7 with those already introduced by Khalimsky [ 3 ] and Kong [ 4 ]. 
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A01UO2 



Fig. 2. A digital object for which the Digital Seifert-Van Kampen Theorem does 
not hold 
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Abstract. The watershed transformation is a powerful tool for segment- 
ing images, but its precise definition in discrete spaces raises difficult 
problems. We propose a new approach in the framework of orders. We 
introduce the tesselation by connection, which is a transformation that 
preserves the connectivity, and can be implemented by a parallel algo- 
rithm. We prove that this transformation possesses good geometrical 
properties. The extension of this transformation to weighted orders may 
be seen as a generalization of the watershed transformation. 

Keywords: discrete topology, order, discrete distance, influence zones, 
watershed 



1 Introduction 

The watershed transformation [ 1 ] is a powerful tool for segmenting images. Ini- 
tially introduced in the field of topography, the notion of watershed is often 
described in terms of steepest slope paths, watercourses and catchment basins. 
Here we prefer the following presentation, which is precise enough to be imple- 
mented by a computer program. 

Gonsider a grayscale image as a topographic relief: the gray level of each point 
corresponds to its altitude. Watersheds may be obtained by piercing a hole at 
each minimum of this relief, and immersing the relief into a lake. The water will 
progressively hll up the different basins around each minimum. When the waters 
coming from different minima are going to merge, a dam is built to prevent the 
merging. At the end of this process, the set of points immersed by the water 
coming from one minimum rrii is called the catchment basin associated to m^, 
and the points that have not been immersed, and where dams have been built, 
constitute the watersheds. 

This transformation may be seen as a set transformation, guided by the gray 
levels, which transforms the set of the points belonging to minima into the set 
of the points belonging to catchment basins. This set transformation has an 
important property: it preserves the connected components of the set (Fig. 1 ). 
On the other hand, the connected components of the complementary set are not 
preserved, as shown in Fig. 1(62,02). It is important to note that a topology- 
preserving transformation, such as an homotopic kernel [2] (which may be seen 
as an “ultimate skeleton”), should preserve both the connected components of 
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the set and those of the complementary set (see Fig. and 62 , 0 ^ 2 )), and in 

3D, it should also preserve the tunnels of both the set and the complementary 
set. 
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Fig. 1. (ai): original images; ( 6 ^): minima of ai (in white); (q): catchment basins 
of ai (in white); di\ an upper homotopic kernel of bi (in white) 

Many definitions and algorithms for the watershed transformation have been 
proposed [3, 4, 5, 6 , 7]. Particularly difficult is the problem of correctly specifying 
the watersheds that are located in plateaus, especially in the discrete spaces. Sev- 
eral authors have used a notion of distance in order to impose a “good centering” 
of the watersheds in plateaus, but in the usual discrete grids this centering is not 
always perfect. Their approach is based on the notion of influence zones: to a 
subset X of which is composed of k connected components Xi, . . . , Xk, we 
can associate the influence zones Vi , . . . , such that a point x belongs to 4^ if x 
is nearer from Xi than from any other component Xj of X. This constitutes a 
set transformation, which does not preserve any topological characteristic (as de- 
fined in the framework of digital topology [ 8 ] ) , not even the number of connected 
components: see Fig. 2. 
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Fig. 2. Two regions (in black) and their respective influence zones (shaded) for 
the usual 4-distance. We see that the union of the two influence zones is 4- 
connected, while the original set (black points) is not 4-connected 



This paper extends a work presented in [9], in the framework of graphs. We 
propose a new approach based on the notion of order [10]. An order is equiva- 
lent to a discrete topological space (in the sense of Alexandroff [11]). In such a 
space, we prove that the influence zones transformation, defined thanks to the 
“natural” distance and applied to a closed set, preserves the connected compo- 
nents. We introduce the notion of uniconnected point, which allows to define a 
set transformation that preserves the connected components of the set: the tes- 
selation by connection. From this set transformation, we derive a transformation 




Tesselations by Connection in Orders 



17 



on grayscale images, and more generally on weighted orders, that can be seen 
as a generalization of a watershed transformation. We propose some parallel al- 
gorithms to implement such transformations in orders and weighted orders. We 
also show that the link between these transformations and the influence zones 
transformation ensures that our algorithms produce a “well centered” result. In 
this paper, we give some examples based on orders that “modelize” the discrete 
grid Z'^ . Nevertheless, we emphasize the fact that all the presented properties 
and algorithms are valid for any order. 

2 Basic Notions 

In this section, we introduce some basic notions relative to orders (see also [10]). 

If X is a set, V{X) denotes the set composed of all subsets of X, if *5^ is a 
subset of X, aS' denotes the complement of *5^ in X. If 5 is a subset of T, we 
write 5^ C T, the notation S d T means S d T and S 7 ^ T. If 7 is a map 
from V{X) to V{X), the dual of j is the map *7 from V{X) to V{X) such 
that, for each 5 C X, *7(5) = j{S). Let be a binary relation on X, z.e., a 
subset of X X X. We also denote by 6 the map from X to V{X) such that, for 
each X of X, S(x) = c X, (x,y) c (^}. We define S° as the binary relation 
= (^ \ {(x, x); X c X}. 

An order is a pair |X| = (X, <a) where X is a set and a is a reflexive, anti- 
symmetric, and transitive binary relation on X. An element of X is also called 
a point. The set a{x) is called the a- adherence of x, if y G a{x) we say that y is 
a-adherent to x. 

Let (X, (a) be an order. We denote by a the map from V{X) to V{X) such 
that, for each subset S of X, a{S) = U{<a(x); x c 5}, a{S) is called the a- 
closure of 5, ^a(S) is called the a-interior of S. A subset 5 of X is a-closed if 
S = (a(5), S is a-open if S = ^a(S). 

Let (X, a) be an order. We denote by f3 the relation f3 = {(x, y); (^, x) G a}, 
j3 is the inverse of the relation a. We denote by 6 the relation 6 = ad [3. The 
dual of the order (X, a) is the order (X, (3). 

Note that *(a(5) = {x G 5; j3{x) d 5}, and ^(3{S) = {x d S] a{x) d S}. 

The set Oo, composed of all o;-open subsets of X satisfies the conditions for 
the family of open subsets of a topology, the same result holds for the set Oy 
composed of all /3-open subsets of X; we denote respectively by 7^ = (X, Oq.) 
and by 7^ = (X, Oy) these two topologies. These topologies are Alexandroff 
topologies, z.e., topologies such that every intersection of open sets is open [ 11 ]. 

An order (X, a) is countable if X is countable, it is locally finite if, for each 
X G X, is a finite set. A CF- order is a countable locally finite order. 

If (X, (a) is an order and *5^ is a subset of X, the order relative to S is the 
order |5| = {S,an{Sx S)). 

Let (X, a) be a CF-order. Let xq and Xk be two points of X. A path from xq 
to Xk is a sequence xo,xi, ...,x/c of elements of X such that Xi G 6{xi-i)^ with 
i = l,...,/c. The number k is called the length of the path. We consider the 
relation {{x,y)] there is a path from x to y}. It is an equivalence relation, its 
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equivalence classes are called the connected components of X. We say that (X, a) 
is connected if it has exactly one connected component. 

Let |X| = {X,a) be a CF-order. The a-rank of a point x is the length of a 
longest <a-path having x as origin. It is denoted by r(x). 

Let (X, (a) be an order. An element x such that a^(x) = 0 (z.e., such that 
r{x) = 0) is said to be a-terminal (for X ). 

3 Orders Associated to 2 ^ 

We give now a presentation of some orders which may be associated to Z'^ [12,10]. 

Let Z be the set of integers. We consider the families of sets LTq, 
such that, = {{a}; a G X}, Hi = {{a, a + 1}; a G X}, U Hi 

A subset S of Z'^ which is the Cartesian product of exactly m elements of Hi 
and (n — m) elements of H^ is called a m-cube of Z^. We denote H^ the set 
composed of all m-cubes of m = 0, ..., n. 

An m-cuhe of Z^ is called a singleton if m = 0, a unit interval if m = 1, a unit 
square if m = 2, a unit cube if m = 3. 

In this paper, the basic order associated to Z^ is the order where 

a thus y G a{x) \i x ^ y. In Fig. 3(a), an example of a subset S of H‘^ is 
given. The object S is made of two connected components S\ (to the left) and S 2 
(to the right). It may be seen that S\ contains one singleton (a-terminal, a-rank 
0), two unit intervals (a-rank 1), and two unit squares (/3-terminals, a-rank 2). 
In Fig. 3(b), an alternative representation of the same object is presented, we 
call it the array representation. We use the following conventions: a singleton is 
depicted by a circle (o), a unit interval by a rectangle (^), and a unit square 
by a square ( □ ) . 
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In order to build consistent topological notions for a subset S of Z^^ we 
associate to 5 a subset ^{S) of H^] thus we recover the structure of a (dis- 
crete) topological space by considering the order (iL’^,G). In this paper, the 
transformation is chosen in such a way that the induced topological notions 
may be seen as “compatible” with the notions derived from the digital topology 
framework. A natural idea for defining is to consider “hit or miss” transfor- 
mations [13]. Thus we consider the set composed of all elements of H^ which 
have a non-empty intersection with 5. In a dual way, we consider the set 
composed of all elements of H^ which are included in S: 

= {x e H^.xns ^0} ; = {x e H^,x c S} 
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Fig. 4(a) presents an object S of Z‘^ (in black). The adjacency used for the 
object is denoted by a, the adjacency used for the background is denoted by 
a (see [8]). Note that if (a, a) = (4,8), then S is composed of two connected 
components and S is connected. On the other hand if (a, a) = (8,4), then S 
is connected and S is composed of two connected components. The images 
and S'^ are shown in Fig. 4(b) and (c) respectively. We note that in we 
retrieve the connectivity that corresponds to (8,4), and in S'^ we retrieve the 
connectivity that corresponds to (4,8). For a more complete discussion about a 
model of the digital topology notions in the framework of orders, see [14]. 

Note also that is always an a-open set (z.e., a /3-closed set), and that S'^ 
is always an a-closed set. 



4 Distance and Influence Zones 

Let (X, (a, j3) be a CF-order, let x and y be two points of X. Let tt = (xq, . . . , x/^), 
with X = xo, ^ = x/c, be a path from x to y. The length of tt is denoted by /(tt) 
and is equal to k. We denote by IIx,y the set of all the paths from x to and 
we define d{x^y) = min{/(7r),7r G 47cc,y}, the distance between x and y. A path 
(j from X to ^ such that /(cr) = <i(x, y) is a shortest path from x to y. 

It may be easily checked that is a (discrete) distance on X, that is, a map 
from XxX to JV that verifies the properties of symmetry, separation (<i(x, y) = 0 
implies x = y) and triangular inequality (<i(x, y) < d{x, z) + d{z^ y)). 

If T is a subset of X, we define d{x,Y) = mm{d{x,y),y G T}, and call this 
value the distance between x and Y. A path a from x to a point y ^Y such that 
I {a) = d(x, Y) is called a shortest path from x to Y. 

Let be a subset of X, and let {i?i, . . . , Rm} be the connected components 
of R. For each z = 1, . . . , m, we define the influence zone Vi associated to Rp. 
Vi = {x e X/Vj G [1, . . . ,m], j 7 ^ i,d{x,Ri) < d{x,Rj)}. 

Ideally, the influence zones should possess the following, desirable properties: 
first, they should be connected sets, and second, they should be mutually discon- 
nected, i.e. any union of two different influence zones should be a non-connected 
set. If we consider the discrete plane and the classical 4- or 8-distance, the 
property of mutual disconnection is not verified (see Fig. 2). 

Here, we prove that the influence zones associated to a family of a-closed 
or /3-closed subsets of a CF-order possess these two fundamental properties. We 
have to prove first these two intermediate lemmas: 
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Lemma 1 Let (X, o;, (5) he a CF-order, and let tt = (xq, . . . , x^) be a path. If tt 
is a shortest path from xq to x^, then Vi = 0, . . . , /c — 2 we have: 

Xij^i G a°{xi) => Xi ^2 e (3°{xi^i), and G (3°{xi) => x ^+2 ^ a°{xi^i). 

Proof: clearly, Vi = 0, . . . , /c — 1; X{ jtz Suppose that G a°{xi) 

and Xi -^2 ^ (^i+i)- We know that tt is a path, hence we have x ^+2 ^ (^i+i)- 

Thus, Xi -^2 ^ and we see that the path (xq, . . . , x^+ 2 , • • • , x/c) is shorter 

than 7T, a contradiction. □ 

Lemma 2 Lei {X,a,j3) he a CF-order, let R he an a-elosed subset of X, let x 
he a point of X \R. Then, there exists a shortest path tt from x to R sueh that 
the last point y of tt is an a-terminal. 

Proof: let tt be a shortest path from x to R, let w,y be the two last points of 
TT {y G R). We have y G a{w), because if we suppose w G a{y), then w belongs 
to R {R is (a-closed), which contradicts the hypothesis “tt is a shortest path 
from X to i?” . 

Suppose that y is not an a-terminal, i.e. a{y) jtz {j}. As R is a-closed, we 
have a{y) C R and a{y) contains a-terminals, let y' be one of them. We have 
y G a(w) and y' G a{y), thus y' G a{w). Let us consider the path tt' identical to 
TT except that y is replaced by y'\ it is also a shortest path from x to R, and its 
last point is an a-terminal. □ 

We are now ready to prove the aforementioned properties: 

Property 3 Let {X, a, (3) be a CF-order, let R be a subset of X, let R\, . . . , Rm 
be the eonneeted eomponents of R, and letVi, . . . , Vm be the influenee zones asso- 
eiated to R\, , Rm, respeetively. For eaeh i = 1, . . . ,m, the set Vi is eonneeted. 

Proof: we shall prove that for any x in a shortest path between x and Ri 
is entirely included in Vi . The property follows immediately from this result and 
from the connectedness of Ri. 

Let X G Vi. Following the definition of influence zones, there exists a short- 
est path TT from x to Ri such that /(tt) = d{x,Ri) and /(tt) < d{x, Rj),\/j G 
[1, . . . , m], j ^ i. Let y be any point in tt, and suppose that y ^ Vi, which means 
that there is a /c G [1, . . . , m], k ^ i, such that d{y, Rk) < d{y, Ri). 

As TT is a shortest path from x to Ri, the subpath tt' of tt from y to Ri is a 
shortest path from y to Ri, hence 1{tt') = d{y,Ri). Also, the subpath tt" of tt 
from X to ^ is a shortest path from x to y, hence 1{tt") = d{x, y). 

Let cr be a shortest path from y to Rk, we have l{a) = d{y,Rk) and hence 
l(o') ^ Then using the triangular inequality: d{x,Rk) < + l{(j) < 

1{tt") -f- 1{tt'), that is d{x,Rk) < d{x,Ri), a contradiction. □ 

Property 4 Let (X,a,(3) be a CF-order, let R be an a-elosed subset of X, 
let R\, . . . , Rm be the eonneeted eomponents of R, and let V\, . . . , Vm be the in- 
fluenee zones assoeiated to Ri, ... , Rm, respeetively. Then, the Vi ^s are mutually 
diseonneeted, i.e. \/i,j G [1, . . . ,m],i j,ViVJVj is not eonneeted. 




Tesselations by Connection in Orders 



21 



Proof: suppose that there is an x in Vi and a ^ in such that x C 0{y). 
We shall: a. prove that d(x^Ri) = d{y^Rj), and b. raise a contradiction using 
lemmas 1 and 2. 

a. Suppose that d{x,Ri) < d{y,Rj). Let tt be a shortest path from Ri to x, 
and let cr be a shortest path from Rj to y^ we have: /(tt) = d(x^ Ri)^l{a) = 
d{y, Rj). The sequence ny is a path from Ri to y, and we have l{7ry) = /(tt) + 1, 
hence l{7ry) < 1 {(t). But d{y^Ri) < l{7ry) (by definition of d) , hence d{y^Ri) < 
d{y^ Rj)^ a contradiction with the fact that y e Vj. A symmetric argument shows 
that d{x,Ri) > d{y^Rj) is also false, hence d{x^Ri) = d{y^Rj). 

b. Let 7T be a shortest path from Ri to x, begining by an a-terminal (see 
lemma 2) and ending by the sequence (x',x). Let cr be a shortest path from Rj 
to y, begining by an cr-terminal and ending by the sequence (y'^y). These two 
paths having the same length (see a.), and beginning by cr-terminals, we deduce 
from lemma 1: x C a(x') y C OL^y'). Our hypothesis x C 0{y) implies either 
X C OL^y) or y G a{x). For example, if x G a{y) and y G <a(^'), then we have 
X G <a(^'), hence l{ax) = l{a) = /(tt). This implies that d{x^Rj) < d{x,Ri), a 
contradiction with the fact that x G There are three other possibilities, that 
lead to a contradiction in similar ways. □ 
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Fig. 5. Examples of families of cr-closed (a,d) or /3-closed (b,c) sets (in black), 
and their influence zones (in gray) 
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5 Tesselation by Connection 



In Fig. 5, we show some examples of families of cr-closed (Fig. 5(a), in black) and 
/5-closed (Fig. 5(b), in black) subsets of and their influence zones (in gray). 
In these cases, the complementary of the influence zones is “thin” ; nevertheless, 
we cannot guarantee such a property in the general case. Fig. 5(c,d) shows 
counter-examples. In many image analysis applications, we need “thin” frontiers 
between the influence zones, this motivates the introduction of the following 
notions. Intuitively, a point is uniconnected if its addition preserves the connected 
components, in other terms, it preserves a connection in the sense of [15]. 

Let (X,a,f3) be a finite CF-order, let C X, and let x G R. The point x 
is uniconnected (for R) if the number of connected components of R equals the 
number of connected components of R U {x}. The point x is a -uniconnected 
(for R) if X is uniconnected for R and if a(x) n R ^ 9. 

We can easily see that a point x ^ R is uniconnected if and only if 6{x) 
intersects exactly one connected component of R. A point x ^ R is said to 
be multiconnected (for R) if 0{x) intersects at least two different connected 
components of R. It is isolated (for R) if 6{x) n R = 0. 
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Fig. 6. (a): xi is uniconnected, X 2 is multiconnected; (b): labeling the compo- 
nents makes the local checking possible; (c): uniconnected points (in gray) cannot 
be added in parallel without changing the number of connected components 



In Fig. 6, we can see that the information contained in d(x) is not sufficient to 
check whether a point x is uniconnected or multiconnected. On the other hand, 
if we assume that the points of each connected component of R are labeled with 
an index which represents this component, then we can check whether a point x 
is uniconnected or not by counting the number of different indexes carried by 
the points in d(x) (see Fig. 6(b)). Furthermore, a uniconnected point x is a- 
uniconnected if a(x) contains at least one labeled point. 

Let R C X, we say that T C X is a thickening by connection of Rif T may 
be derived from R by iterative addition of uniconnected points. We say that T 
is a tesselation by connection of if T is a thickening by connection of R and if 
all the points of X \ T are not uniconnected. 

In general, there are several tesselations by connection for a set R. This is 
due to the iterative nature of the definition: depending on the order of selection 
of the uniconnected points, one can get different results. Nevertheless, in many 
applications we want to obtain a “well centered” result, that could be uniquely 
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defined. This is why we introduce, by the way of a parallel algorithm, a particular 
tesselation by connection that possesses good geometrical properties. 

We can see in Fig. 6(c) that, in general, uniconnected points cannot be added 
in parallel to a set R without changing the number of connected components of R. 
In fact, (a-uniconnected points can indeed be added in parallel to R without 
changing the number of connected components of R. This can be proved by 
induction thanks to the following property. 

Property 5 Let (X, o,/3) be a CF-order, let R C X, and let x,y e R be two 
a-unieonneeted points for R. Then, x is a-unieonneeted for R U {y}. 

Proof: if ^ ^ then the property is obvious. If ^ C a{x) or x C <a(^), 

then the component of R which is o-adherent to y is clearly the same as the 
component of R which is o-adherent to x. Thus in both cases, adding the point y 
to R does not change the fact that x is o;-uniconnected.n 

The following algorithm computes a tesselation by connection of a closed 
subset of X. 

Algorithm 1 

Input data: 

(X,a,j3), a finite CF-order, 

i?i, . . . , Rm, the m eonneeted eomponents of an a-elosed subset R of X 
Output: a tesselation by eonneetion T assoeiated to R 
Initialization: 

:= R; := 0; n := 1; label the points of eaeh Ri with the index i 
Repeat until stability: 

Compute the set of the points that are unieonneeted for T'^~^ , 

and the set of the points that are multieonneeted for T'^~^ 

(this ean easily be done using the indexes) 

Label the points x of with the index of the eomponent found in 6{x) 

Label the points y of U'^ with the index 0 
Tu := U := B^~^ UU^ ; n := n -h 1 

End Repeat 

._ 'jpn 

It can be easily seen that for any even (resp. odd) value of n, the set 
is o-closed (resp. /3-closed) in \X \ B^\. From this, it follows that is only 

composed of o-uniconnected (resp. /3-uniconnected) points if n is even (resp. 
odd). Thus (Prop. 5), the result T of this algorithm is a thickening by connec- 
tion of the original o-closed set R. In addition, at the end of this algorithm no 
uniconnected point remains. Thus, the result T is a tesselation by connection of 
the original set R. 

The following property makes a link between the tesselation by connection 
computed by algorithm 1 an the influence zones, ensuring that this tesselation 
by connection is “well centered”. 

The following lemma and the following property derive from the fact that 
the labeling process is guided by a breadth-first strategy. 
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Lemma 6 Every point x that receives an index during the step n of the algo- 
rithm^ is such that d{x,R) = n. 

Every point x that receives an index z > 0 during the step n of the algorithm, is 
such that d{x, Ri) = n. 

Property 7 Let {X,a,f3) be a CE-order, let R\, . . . ,Rm he the m connected 
components of an a-closed subset R of X , let , Vm be the influence zones 

associated to R\, , Rm, respectively, and let ... , Tm be the connected com- 

ponents of the tesselation by connection of R computed by algorithm 1. Then we 
have : Mi = 1, . . . ,m, Vi C T^. 

This property establishes that the influence zones are included in the com- 
ponents of the tesselation by connection computed by algorithm 1. The converse 
is true only in some cases, like in Fig. 5(a,b,d); but it is not true in general, as 
shown by the counter-examples of Fig. 7. 
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Fig. 7. (a,b): some objects (in black) and their influence zones (in gray). (a’,b’): 
the same objects (in black) and their tesselation by connection (in gray) 



6 Tesselation by Connection for Weighted Orders 



In this section, we extend the notions of uniconnected point and tesselation 
by connection to weighted orders. A tesselation by connection for weighted or- 
ders may be considered as a generalization of the watershed transformation [1]: 
the result of the proposed transformation is a function, whereas the result of a 
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watershed transformation is a set. We recover a set which corresponds to the 
complementary of the watersheds, by extracting the regional maxima of the 
tesselation by connection. 

Let \X\ = [X^a^(3) be a finite CF-order, and let W be a mapping from X 
to Z. The couple {\X\, W) is called a weighted order on \X\. For applications to 
digital image processing, W (x) typically represents the graylevel of the point x. 
We denote by FF/c, and call the eross-seetion ofW at level /c, the set Wk = {x c 
X, W{x) > k} with k G Z. The weighted order (|X|, W) is a-elosed if, \/k c X, 
the order \Wk\ is a-closed. 

Let (|X|, FF) be a weighted order, and let x c X. The point x is unieonneeted 
for FF if X is unieonneeted for FF/^+i, with k = FF(x). 

Let (|X|, FF) and (|X|, M) be weighted orders, we say that M is a thiekening 
by eonneetion of FF if M may be derived from FF by iteratively selecting a 
unieonneeted point x and raising its value FF(x) by one. We say that M is a 
tesselation by eonneetion of FF if M is a thickening by connection of FF and if 
all the points of X are not unieonneeted for M . 

Algorithm 2 

Input data: (|X|,FF), a finite a-elosed weighted order, 

Output: a tesselation by eonneetion M assoeiated to FF. 

Initialization: Let L = (ko, . . . ,ks) the list of values taken by W{x), 
sorted in inereasing order. 

For all i from 1 to s 

Compute the tesselation by eonneetion Z of FF/^. 

in the order indueed by FF/c-_^, using algorithm 1 
For all z ^ Z \ FF/c-_^ do W[z] := ki; End For 
End For 
M :=W 

The following transformations (introduced in [16]) may be used to construct 
a closed weighted order from a grayscale image. 

Let T be the set of functions from Z'^ to M . Let W be the set of functions 
from to J\f. We define the transformations and from X to VF which 
associate to each F in X the functions FF^ and FF"^ respectively, defined by: 

Vx G W^{x) = min{F{y), y G {y} G a{x)} 

Vx G FF^(x) = max{F(^), y G Z^, {y} G a(x)} 

It can be easily seen that (|X|, FF"^) is an o-closed weighted order, and that 
(|X|,FF^) is a /3-closed weighted order. In Fig. 8, we show the weighted order 
obtained by applying F^ to a “real” image, and the tesselation by connection 
of this weighted order computed by algorithm 2. 
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Abstract. We recall the definition of simple points which uses the dig- 
ital fundamental group introduced by T. Y. Kong in [Kon89]. Then, we 
prove that a not less restrictive definition can be given. Indeed, we prove 
that there is no need of considering the fundamental group of the com- 
plement of an object in order to characterize its simple points. In order 
to prove this result, we do not use the fact that “the number of holes 
of X is equal to the number of holes in X” which is not sufficient for our 
purpose but we use the linking number defined in [FMOO]. In so doing, 
we formalize the proofs of several results stated without proof in the 
literature (Bertrand, Kong, Morgenthaler) . 

Keywords: digital topology, linking number, simple points 

1 Introduction 

The definition of a simple point is the key notion in the context of thinning 
algorithms. Indeed, this definition leads to the most commonly admitted defini- 
tion of the fact that a given thinning algorithm does preserve the topology of a 
digital image. Usually, one says that an image Xi is topologically equivalent to 
an image X 2 if X\ can be obtained from X 2 by sequential addition or deletion of 
simple points. In this case, a simple point is defined as a point the deletion of 
which does not change the topology of the image. The problem with topology 
preservation in 3D is that taking care not to change the number of connected 
components in the image as well as in its bakground is not sufficient as in 2D. 
In 3D, one must also take care not to change the number and the location of 
the tunnels as donuts have. Now, different characterizations have been proposed 
by several authors which all lead to local characterizations which are equivalent. 
A first set of characterizations mainly use the Euler characteristic in order to 
count the number of tunnels, but even if this kind of characterization leads to 
a good local characterization it is limited by the fact that no localization of the 
tunnels is provided by the use the Euler characteristic (see Figure 3). Another 
definition has been proposed by Kong & al. in [KR89] which is based on re- 
marks made by Morgenthaler in [MorSI], with a new formalism which inlvolves 
the digital fundamental group introduced by Kong in [Kon89] . In this latter def- 
inition, topology preservation is expressed in term of existence of a canonical 
isomorphism between the fundamental group of the object and the fundamental 
group of the object without the considered point to be removed, and a similar 
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isomorphism must exist in the background of the image (see Definition 3). In 
this paper, we prove that this second condition is in fact implied by the first one. 
In other words, we show that preserving the tunnels of the objects will imply the 
preservation of tunnels in the background. In order to prove that a such more 
concise characterization can be given, we use the linking number between paths 
of voxels as defined in [FMOO] which provides an efficient way to prove that a 
given path cannot be homotopic to a degenerated path. Furtermore, the proof 
of Theorem 2 given in Section 3 uses propositions an lemmas, some of which are 
the direct answers to some open questions left by Morgent haler in [Mor81] such 
as: do any two paths which can be be continuously deformed one into each other 
in an object X keep this property after removal of a simple point of X ? 

2 Definitions 

2.1 Digital Image, Paths, Connectivity 

In this paper, we consider objects as subsets of the 3 dimensional space I?. 
Elements of 1? are called voxels (short for “volume elements”). The set of voxels 
which do not belong to an object O C constitute the complement of the 
object and is denoted by O. Any voxel is identified with a unit cube centered 
at a point with integer coordinates v = (z, j, k) ^ I? . Now, we can define some 
binary symmetric and anti-refiexive relations between voxels. Two voxels are 
said Q— adjacent if they share a face, IS— adjacent if they share an edge and 
26— adjacent if they share a vertex. By transitive closure of these adjacency 
relations, we can define another one: connectivity between voxels. We first define 
an n-path tt with a length I from a voxel a to a voxel 6 in O C as a sequence 
of voxels {yi)i=o...i such that for 0 < z < / the voxel yi is n-adjacent or equal 
to yi-\-i, with yo = a and yi = b. The path tt is a closed path if = yi and 
is called a simple path if y^ ^ yj when i ^ j (except for and yi if the path 
is closed). The voxels and yi are called the extremities of tt even in the case 
when the path is closed, and we denote by tt* the set of voxels of tt. A closed 
path (x, x) with a length 1 for x G I? is called a trivial path. If x is a voxel of 
Z^ and n G {6, 18, 26} then we denote by Nn{x) the set of voxels of I? which 
are n— adjacent to x. We call Nn{x) the n— neighborhood of x. A subset C of 
1? is called a simple closed n— curve if it is n— connected and any voxel of C is 
n— adjacent to exactly two other voxels of C; then, if c is a simple closed n— path 
such that c* = C, c is called a parameterized simple closed n— curve. 

Given a path tt = {yk)k=o,... we denote by 7t“^ the sequence {y'j^)k=o,... ,i 
such that yk = y[-k for A; G {0, . . . , /}. 

Now we can define the connectivity relation: two voxels a and b are called 
n- connected in an object O if there exists an n-path tt from a to 6 in O. This is 
an equivalence relation between voxels of O, and the n— connected components 
of an object O are equivalence classes of voxels according to this relation. 

In order to avoid topological paradoxes, we always study the topology of an 
object using an n— adjacency for the object and a complementary adjacency n for 
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its complement. We sum up this by the use of a couple (n, n) in {( 6 , 26), ( 6 +, 18), 
(18, 6 +), (26, 6 )}. The notation 6 + is used in order to distinguish the relation of 
6 — connectivity associated to the 26— connectivity from the ( 6 +)— connectivity 
associated to the 18— connectivity. 

If 7 T = {yi)i=o^... and tt' = (^^)/.=o,... ,p' are two n— paths such that ijp = 
then we denote by tt.tt' the path (^o, • • • , yp-ijy'o^ • • • ly'p') which is the concate- 
nation of the two paths tt and tt'. 



2.2 Geodesic Neighborhoods and Topological Numbers 

The geodesic neighborhoods have been introduced by Bertrand ([Ber94]) in order 
to locally characterize in an efficient way the simple points of an object. 

Definition 1 (geodesic neighborhood [Ber94]). Let x ^ X , we define the 
geodesic neighborhood of x in X denoted by Gn{x^X) as follows: 

-Gq{x,X) = {NQ{x)nX)U{y G Xi8(x)| y is6—adjaeent to a voxel of NQ{x)nX}. 

-G 26 {x^X) = N 26 {x)nX. 

Definition 2 (topological numbers [Ber94]). Let X C and x ^ I? . We 

define the topological number assoeiated to x and X, and we denote by Tn{x, X) 
for (n,n) G {(6,26), (26,6)}, the number of eonneeted eomponents of Gn{x, X). 



2.3 Digital Fundamental Group 

In this section, we define the digital fundamental group of a subset X of Z^ 
following the definition of Kong in [Kon89] and [Kon 88 ] . 

First, we need to introduce the n—homotopy relation between n— paths in X. 
Intuitively, a path c is homotopic to a path c' if c can be “continuously deformed” 
into d . Let us consider X C Z^. First, we introduce the notion of an elementary 
n— deformation. Two closed n— paths c and c' in X having the same extremities 
are the same up to an elementary n— deformation (with fixed extremities) in X, 
and we denote c c', if they are of the form c = 'K\.'^.'K 2 and d = 7 ri. 7 '. 7 T 2 , 
the n— paths 7 and 7 ' having the same extremities and being both included in 
a 2 X 2 X 2 cube if (n,n) = (26,6), in a 2 x 2 square if (n,n) = (6,26). Now, 
the two n— paths c and d are said to be n—homotopie (with fixed extremities) 
in X if there exists a finite sequence of n— paths c = cq, . . . , = d such that 

for z = 0 , . . . , m — 1 the n— paths q and are the same up to an elementary 
n— deformation (with fixed extremities). In this case, we denote c d . A closed 
n— path c = (xo, ... , = xq) in X is said to be n— reducible in X if c (xq, xq) 

in X. 

Let 5 G X be a fixed voxel of X called the base surf el. We denote by A^{X) 
the set of all closed n— paths c = (xq,... ,x^) which are included in X and 
such that Xq = Xp = B. The n—homotopy relation is an equivalence relation 
on A^(X), and we denote by 77]^ (X, B) the set of the equivalence classes of this 
equivalence relation. If c G A^(X), we denote by [c]]j^(^x,b) Ihe equivalence class 
of c under this relation. 
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The concatenation of closed n— paths is compatible with the n— homotopy 
relation, hence it defines an operation on 71]^ (X, B), which to the class of ci and 
the class of C 2 associates the class of C 1 .C 2 . This operation provides Ui{X^B) 
with a group structure. We call this group the n— fundamental group of X. The 
n— fundamental group defined using a voxel B' as base surfel is isomorphic to the 
n— fundamental group defined using a voxel B as base surfel if X is n— connected. 

Now, we consider Y d X dl? and 5 G T a base voxel. A closed n— path in Y 
is a particular case of a closed n— path in X. Furthermore, if two closed n— paths 
of Y are n— homotopic (with fixed extremities) in T , they are n— homotopic (with 
fixed extremities) in X. These two properties enable us to define a canonical 
morphism A • B) — > 77]^ (X, 5), which we call the morphism induced by 

the inclusion map i : Y — > X. To the class of a closed n— path a G A^{Y) in 
TIf (y, B) the morphism A associates the class of the same n— path in 77f (X, B). 

2.4 The Digital Linking Number 

The digital linking number, denoted by has been defined in [FMOO] for a 
couple (tt, c) of closed paths of which do not intersect each other. It is the 
digital analogue of the linking number defined in knot theory (see [Rol]) and 
it is immediately computable (see [FMOO]) for any couple (tt, c) such that tt is 
an n— path and c is an n— path with (n, n) G {(6, 26), (26, 6), (6+, 18), (18, 6+)}. 
This number counts the number of times two digital closed paths are interlaced 
one with each other, as illustrated in Figure 1. Since the precise definition of 
the linking number is too long to be recalled here, we simply give the idea of 
the way it can be computed in Figure 2. In this figure, we have depicted several 
configurations which illustrate the contributions of superposed voxels, one in 
each paths, in a 2D projection of the two paths. The digital linking number is 
nothing but the sum of such contributions for all couples of overlapping voxels. 




(a) L' 7 t,c — dzl. 




(b) = ±2. 




(c) The Whitehead’s link: 

L/7T,C 0. 



Fig. 1. Three kinds of links: a 6— path tt in black and a 18— path c in white 



The two following theorems have been proved in [FMOO] and allows to say 
that the linking number is a new topological invariant in the field of digital 
topology. 
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Fig. 2. Contributions associated with voxels where the two paths of a link overlap 
in a 2D projection of a link 



Theorem 1. Let tt and n' be two elosed n—path and c be a elosed n—path ofl? 
sueh that tt* fl c* = 0 and tt'* H c* = 0. //tt tt' in 1? \ c* then = Ltt'.c 
aTi/d L/ Q — d-j Q • 

Remark If c is a trivial path, then Lc, 7 t = 0 for any closed n—path such 
that c* n 7T* = 0. It follows that if a closed n—path c in X C is n— reducible 
in X, then Lc, 7 t = 0 for all closed n—path tt in c*. 

2.5 Characterization of Simple Points 

A simple point for X C is a point the deletion of which does not change the 
topology of X. Now, topology preservation in 3D is not as simple to express as in 
the 2D case because of the existence of tunnels. A few authors have used two main 
tools to study topology preservation: the Euler characteristic which only allows 
to count the number of tunnels of an object ([TF82]), and the digital fundamental 
group ([Kon89]) which allows to “localize” the tunnels. Indeed, as depicted in 
Figure 3, counting the number of tunnels is not sufficient to characterize the fact 
that the topology is preserved. In this paper, we are interested by a definition 
of simple points which uses the digital fundamental group and which avoids 
the problem previously mentioned. The following definition appears as the most 
convenient for the property “the deletion of x preserves topology of X” . It comes 
from the criterion given in [Kon89] for a thinning algorithm to preserve topology. 

Definition 3. Let X dl? and x G X. The point x is said to be n— simple if: 

i) X and X \ {x} have the same number of n— eonneeted eomponents. 
a) X and X U {x} have the same number of n— eonneeted eomponents. 

Hi) For eaeh voxel B in X \ {x}, the group morphism i^ : 77]^ (X \ {x}, B) — > 
77f(X, 5) indueed by the inelusion map i : X \ {x} — > X is an isomor- 
phism. 
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Fig. 3. The gray point can be removed without changing the Euler characteristic 
of the object which is equal to zero. However, this point is obviously not n— simple 
(if (n,n) e {(6, 18), (6,26)}) 



iv) For each voxel B' in X, the group morphism i'^ : 77f (X, B')X — > TIf (X U 
{x}, B') induced by the inclusion map i' : X — > XU{x} is an isomorphism. 

Bertrand, in [BM94], gave a local characterization for 3D simple points in 
term of number of connected components in geodesic neighborhoods. However, 
the definition of simple point given in [BM94] differs from the definition used here 
since it does not consider any morphism between digital fundamental groups but 
only require the preservation of cavities and “tunnels” . One purpose of this paper 
is to well formalize the fact that the local characterization given by Bertrand is 
a consequence of the three first conditions of Definition 3 and conversly that the 
four conditions are themselves consequences of the local characterization using 
the topological numbers. We recall here the characterization given by Bertrand 
and Malandain in [BM94]. Note that the definition of simple points used in this 
proposition slightly differs from Dehnition 3. 

Proposition 1 ([BM94]). LetxeX and (n, n) G {(6, 26), (26, 6)}. The point x 
is a n— simple point if and only ifTn{x,X) = 1 and Tn{x,X) = 1. 

3 A New Characterization of 3D Simple Points 

In the sequel of this paper (n, n) G {(6, 26), (26, 6)}. 

In this section, we state the main result of this paper which is that a not 
less restrictive criterion for topology preservation is obtained using the only 
conditions z), ii) and Hi) of Definition 3. In other words, we prove the following 
theorem: 

Theorem 2. Let X d 1? and x G X. The point x is n— simple for X if and 
only if: 

i) X and X \ {x} have the same number of connected components. 

ii) X and X U {x} have the same number of connected components. 

Hi) For each voxel B in X\ {x}, the group morphism i^ : Uf{X \ {x}, B) — > 
77f(X, H) induced by the inclusion map i : X \ {x} — > X is an isomor- 
phism. 
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In order to prove this theorem, we first prove (Subsection 3.1) that a voxel 
which satisfies the three conditions of Theorem 2 also satisfies the the local 
characterization given by Proposition 1 and then, we show (Subsection 3.2) that 
this characterization itself implies that the four conditions of Definition 3 are 
satisfied. 

In the sequel of the paper, we may suppose without loss of generality that X 
is an n— connected subset of and that x and B are two distinct voxels of X 
whereas B' is a voxel of X. Furthermore, z* : TIf (X\{x}, B) — > TIf (X, B) is the 
group morphism induced by the inclusion of X\{x} in X; and i'^ : 77]^ (X, B') — > 
77]^(XU{x}, B') is the group morphism induced by the inclusion of X in XU{x}. 

Remark 2. In the sequel, we shall admit the basic property that, if Y" C X are 
n— connected subsets of Z^, the group morphism from 77]^ (Y, 5) to 77{^(X, 5) 
induced by the inclusion of Y in X for a base surfel 5 c Y is an isomorphism 
if and only if the group morphism between 77{^(Y, 5') and 77]^ (X, 5') is an 
isomorphism for any base surfel B' 

3.1 First Step of the Proof of Theorem 2 

The purpose of this section is to prove the following proposition. 

Proposition 2. If the eonditions i), ii) and Hi) of Definition 3 are satisfied, 
then Tn{x,X) = 1 and Tn{x,X) = 1. 

In order to prove this proposition, we introduce several other propositions 
and lemmas. 

Proposition 3. If Tn{x, X) > 2, then either an n—eonneeted eomponent of X 
is ereated by deletion of x, or the morphism i^ is not onto. 

Sketch of proof of Proposition 3: The proof of a similar proposition given 
in [BM94] may be adapted to our formalism. It involves the definition of a 
number u{x, a, C) which counts the number of time a given n— path a goes from 
a component C oi X) to x minus the number of times a goes from x to C 

(see Figure 4(a)). This number is shown to be invariant under n— homotopic 
deformations of the n— path a inside X. Then, it allows to prove that, when 
connected components of Gn{x,X) are n— connected in X \ {x}, an homotopy 
class of paths distinct from the class of the trivial path, cannot be reached by 
the morphism i^ (class of the path a of Figure 4(a)). □ 

Here, we state the following new proposition. 

Proposition 4. If Tn{x, X) = 1 and Tn{x,X) > 2 then two n—eonneeted eom- 
ponent of X are merged by deletion of x or the morphism i^ is not one to one. 

The main idea of this paper is to use the linking number in order to prove 
Proposition 4. Indeed, until this paper and the possible use of the linking number, 
one could prove that when T^(x, X) = 1 and Tn(x, X) > 2 and no n— connected 
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Fig. 4. Parts of the proofs of Propositions 3 Fig. 5. Idea of the proof 
and 5 of Proposition 4 



component of X are merged by deletion of x, then, the morphism ij induced by 
the inclusion of X in X U {x} is not onto. In other words, “a hole is created in 
X U {x}” . Indeed, a similar proof to Proposition 3 would lead to the following 
proposition (see Figure 4(b)). 

Proposition 5. If Tn{x, X) = 1 and Tn{x^X) > 2 then two n— connected com- 
ponent of X are merged by deletion of x or the morphism i'^ is not onto. 

In this paper, we show that in this case “a hole is created in X \ {x}” or 
more formally, i^ is not one to one. This is proved using the linking number 
as illustrated in Figure 5 where the closed path c is reducible in X (Lemma 2 
below) whereas it is not reducible in X \ {x} since Lc^p = ±1 (Remark 1). This 
shows that a condition on the preservation of tunnels in the object (Condition 
Hi of Definition 3) is sufficient to ensure that tunnels of the complement are also 
left unchanged. And the proof of this result is obtained with the only formalism 
provided by the use of the digital fundamental group. 

Before proving Proposition 4, we must state the two following lemmas. 

Lemma 1. Let x G X such that Tn{x^X) = 1 and Tn{x,X) > 2 and let A = 
Gn(x,X). Then there exists a parameterized simple closed n— curve c in A (see 
Figure 5) and a closed n—path f3 in N 2 e{x) H X such that Lc,p = ±1. 

The latter lemma has been proved by investigating, using a computer, all 
the possible local configurations in the neighborhood of a voxel x such that 
Tn(x,X) = 1 and T^(x,X) > 2 ; showing for each one the existence of a simple 
closed curve in Gn{x,X). The existence of the n—path (3 of Lemma 1 is then 
proved using local considerations. 

Lemma 2. Let x he a point of X such that Tn{x,X) = 1. Then, any closed 
n—path c in Gn{x,X) is n— reducible in X. 
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Sketch of proof of Proposition 4: Let x be a point of X such that T^(x, X) = 
1 and Tn{x^ X) >2. Let c and [3 be the paths of Lemma 1; and let a and b be the 
two voxels of f3 which are n— adjacent to x. If a and b are not n— connected in X 
then it is clear that two n— connected components of X are merged by deletion 
of X from X. 

If a and b are connected by an n— path a' in X. Then, it follows that f3 
is n— homotopic to t he path a = (a). o'. (6, x, a) in {N 2 e{x) H X). Now, from 
Theorem 1 and since (N 26 {x) H X) C c* we have Lc,p = Lc^a = ±1. 

From Theorem 1 (and Remark 1), it follows that the path c is not n— reducible 
in o'* and since o* C X U {x} then X \ {x} C o* so that a fortiori a cannot be 
n— reducible in X \ {x}. Formally, if c is a closed path from a voxel A to A in X \ 
{x}, we have [c]nn(x\{x},A) 7^ [l] 77 f(x\{a:},A)- Now, from Lemma 2, c {A, A) 
in X so that **([c]/7n(x\{a:},yl)) = [c]n"(X,A) = [l]77{‘(v:,yl) = **([l]77{‘(X\{a:},A))) 
i.e. z* is not one to one. □ 

Proof of Proposition 2: Suppose that properties i), ii) and Hi) of Definition 3 
are satisfied. From Proposition 3 we deduce that Tn(x, X) < 2. Furthermore, if no 
connected component of X is removed then Tn(x, X) ^ 0. Finally, Tn{x,X) = 1. 

From Proposition 4 we deduce that Tt^(x,X) < 2. Then, if no connected 
component of X is created then Tt^(x,X) ^ 0. Finally, Tt^(x,X) = 1. □ 



3.2 Second Step of the Proof of Theorem 2 

We prove the following proposition. 

Proposition 6. //Tn(x,X) = 1 and Tn{x,X) = 1, then conditions i), ii), Hi) 
and iv) of Definition 3 are satified. 

The main difficult part of the proof of Proposition 6 relies on the following 
proposition. 

Proposition 7. If Tn{x, X) = 1 and Tn{x,X) = 1 then i^ is an isomorphism. 

Corollary 1. IfTn{x,X) = 1 and Tn{x,X) = 1 then i'^ is an isomorphism. 

In order to prove Proposition 7 we first state Lemma 3 and Lemma 4. 

Lemma 3. If Tn{x, X) = 1 and Tn(x,X) = 1 then for all B e X \ {x} and all 
n—path c of A^{X), there exists a path c' in A^(X\{x}) such that c c' in X. 

Lemma 4. IfTn{x,X) = 1 and Tn{x,X) = 1 then, for any voxel B e X \ {x}, 
two closed n— paths c and c' of A^{X \ {x}) which are n— homotopic in X are 
n— homotopic in X \ {x}. 

Sketch of proof of Proposition 7 : Lemma 3 allows to prove that i^ is onto, 
and Lemma 4 allows to prove that i^ is one to one. □ 

Proof of Proposition 6: It is readily seen that Tn{x,X) = Tn{x,X) = 1 im- 
plies conditions i) and ii) of Definition 3. Then, from Proposition 7 and Corol- 
lary 1, we have Tn{x,X) = 1 and Tn{x,X) = 1 ^ Hi) and iv). □ 
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3.3 End of the Proof of Theorem 2 

Proof of Theorem 2: Following Definition 3, a simple voxel obviously satisfies 
the three conditions of Theorem 2. Now, from Proposition 2, a voxel which 
satisfies the three conditions of Theorem 2 is such that Tn{x,X) = Tn{x, X) = 1. 
Finally, from Proposition 6, if Tn{x^X) = Tn{x^X) = 1 then x satisfies the four 
conditions of Definition 3. □ 



4 Conclusion 

The digital linking number allows us to formalize in a comprehensive way the 
characterization of 3D simple points for the complementary adjacency couples 
(6,26) and (26,6). The new theorem which is proved here shows the usefulness 
of the linking number in order to prove new theorems which involve the digi- 
tal fundamental group in I?. Now, even if the linking number is well defined 
for (n,n) G {(6+, 18), (18,6-b)}, it has not been used yet to provide a charac- 
terization of 3D simple points, similar to Theorem 2, for the latter couples of 
adjacency relations. This, because an open question remains about the existence 
of a simple closed curve, analogue to the curves c of Lemma 1, in this case. 
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Abstract. We introduce the notion of digital n-pseudomanifold and dig- 
ital n-weakmanifold in (n+l)-digital image, in the context of (2n, 3^ — 1)- 
adjacency, and prove the digital version of the Jordan-Brouwer separa- 
tion theorem for those classes. To accomplish this objective, we construct 
a polyhedral representation of the n-digital image, based on cubical com- 
plex decomposition. This enables us to translate some results from poly- 
hedral topology into the digital space. Our main result extends the class 
of “thin” objects that are defined locally and verifies the Jordan-Brouwer 
separation theorem. 

Keywords: digital topology, combinatorial topology, discrete spaces, 
combinatorial manifolds 



1 Introduction 

A binary (n + l)-digital image is an ordered uplet V = 7^, iT), where H 

is a finite subset of and 1Z represents the adjacency relation in the whole 

lattice. 

Morgenthaler and Rosenfeld in [16] introduced for a binary 3-digital image the 
concept of a simple surface point, where 7^ G {(6, 26), (26, 6)}, to characterize 
a class of objects H that verifies the digital version of the Jordan-Brouwer sepa- 
ration theorem. They define a simple surface point axiomatically by three local 
conditions, these conditions reflect basic properties of triangulated simple and 
closed surface’s vertices in They show that any finite and connected subset 
of consisting entirely of simple surfaces points verifies the digital version of 
the Jordan-Brouwer separation theorem. Thus, simple surface points criterion 
determines a class of thin object called digital simple surface of Z^. 

Kong and Roscoe in [11] reveal precisely the geometric meanning of the sur- 
face simple points and extend its properties to other classes of 7^-adjacency, i.e. 
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IZ G {(18, 6), (6, 18), (26, 18), (18, 26)}. They introduce the notion of continuous 
analog for transfering statements from continuous topology to digital space, and 
show that a connected finite subset H of I? is a digital simple surface if and only 
if the continuous analogs of is a simple and closed polyhedral surface of 
Note that [14] extends the concept of continuous analogs to the class of strongly 
normal 3D digital image. 

In [6] Bertrand and Malgouyres defined strong surfaces and in [7] Couprie and 
Bertrand introduced simplicity surfaces. Those classes of surfaces are more richer 
than Morgenthaler-Rosenfeld surfaces because they contain more local configu- 
rations and discrete analogs for these classes of surfaces can be built (see [5] for 
strong surfaces). 

In [4] Ayala, Dominguez, Frances and Quintero prove that any digital n-manifold 
in verifies the Jordan-Brouwer separation theorem (the digital n-manifold 
is the generalization of the notion of digital simple surface for n > 2). 
Malgouyres in [15] investigated the reverse problem in Z^, in the context of the 
(26,6)-adjacency relation. He shows that there is not a local characterization of 
subset of I? which separates I? in exactly two 6-connected components. 

It is not difficult to realize that there is still a large margin between the local char- 
acterization of digital simple surface and the global characterization represented 
by the digital version of the Jordan-Brouwer separation theorem. Intuitively, the 
problem is that there exists polyhedra with singularity which verify the Jordan- 
Brouwer separation theorem. In this context, the notion of n-pseudomanifold is 
very interesting, it is a formulation of weak singularities corresponding to dege- 
narate n-manifold (see property 4 in remark 1). 

This work is a first attempt to bring closer local and global properties that define 
the class of object verifying the digital version of the Jordan-Brouwer separation 
theorem. Our approach is based on combinatorial topology and uses topological 
properties in addition to local neighborhood structures. Definitions and state- 
ments of n-manifold and n-pseudomanifold are given in section 2. Section 3 is 
devoted to review basic of digital topology in . In section 4, we give a sim- 
ple method to construct the continuous analogs of binary (n -h l)-digital image 
based on cubical complex decomposition. This enables us to translate results 
from polyhedral topology into digital topology. Section 5 is aimed to state our 
main results. 

2 n-Manifold and n-Pseudomanifold 

In this section we review some definitions of algebraic topology. 

Definition 1. (see [1], pp. 42 ) 

A triangulation of a space X is a pair {K.cp), where K is a simplicial complex 
and (j) \ \K\ ^ X is a homeomorphism. The complex K is said to triangulate X. 
A polyhedron is any space which admits a triangulation. 

Definition 2. (see [8], pp.20) 

Let K he an homogeneous n-dimensional simplicial complex. A combinatorial n- 
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sphere is a polyhedron \K\ sueh that eertain subdivision of K is simplieially 
homeomorphie to a subdivision of the boundary of the {n + 1) -simplex. 

Let K he 8i simplicial complex and e G K, the link of e in denoted by Lk(e; K)^ 
is a subcomplex of K such that a G Lk{e; K) if and only if 

1. e is not a face of cr, and 

2. 3 cr' G K such that 

2.1. cr is a face of cr', and 

2.2. e is a face of cr'. 

Definition 3. (see [8], pp. 26) 

A eonneeted polyhedron X is said to be an n-manifold without boundary if it 
admits a triangulation (K, (p) satisfying: 

1. K is homogeneously n- dimensional. 

2. for all a G K, Lk{a; K) is a eombinatorial {n — dimmer) — l)-sphere. 

K is ealled a eombinatorial n-manifold without boundary. 

Definition 4. (see [1], pp. 195) 

A eonneeted polyhedron X is said to be an n -pseudomanifold without boundary 
if it admits a triangulation (K, (f) sueh that: 

1. K is homogeneously n- dimensional. 

2. Every {n — l)-simplex of K is a faee of preeisely two n-simpliees of K. 

3. If cr and cr' are two distinets n-simpliees of K , then there exists a sequenee 
cTi, . . . , cr^ of n-simpliees in K sueh that ai = cr, (Ju = cr' cind cri meets cr^+i 
in an {n — 1) -dimensional faee for 1 < i < u. 

K is ealled a eombinatorial n -pseudomanifold without boundary. 

Remark 1. (see [1]) 

1. An n-sphere, denoted by is an n-manifold without boundary. 

2. Every n-manifold without boundary is an n-pseudomanifold without bound- 
ary. 

3. Let X be an n-pseudomanifold and let (L,'0) be any triangulation of X, 
then L is a combinatorial n-pseudomanifold without boundary. 

4. An octahedron is a 2-manifold without boundary. If we identify two opposite 
vertices, we obtain a 2-dimensional curved polyhedron, whose triangulations 
are examples of 2-pseudomanifold without boundary which are not closed 2- 
manifolds without boundary. 

Theorem 1. (see [3], pp. 94) 

Every n-pseudomanifold without boundary in divides into two do- 

mains ^ and is the eommon boundary of both domains. 

This theorem obviously remains true if is substitued for it is sufficient 

to map onto by means of a stereographic projection with center of 

projection lying outside the given pseudomanifold, see [3] page 94. 

Theorem 2. Every n-pseudomanifold without boundary in divides 

into two domains and is the eommon boundary of both domains. 



^ A domain of is an open and connected subset of 
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3 General Definitions 

In digital image, the study of neighborhood is a very important and significant 
concept. For a given lattice point, a neighborhood of a point is defined typically 
using a metric distance. Let p, q ^ with the coordinates ip))t^ and 

respectively. We will consider two types of distance between elements 

in Z”+V 

n+1 

di{p,q) = doo{p,q)= Max \xi{p) - Xi{q)\. 

i=l 2=1... n+1 

Let [3 G { 1 , oo}; two points p,q ^ are said to be (i/ 3 -adjacent if q) = 1; 
we denote by V/ 3 (p) the set of all d/ 3 -neighbors of p. Consequently we have: 
i) Vi(p) = {q ^ / di{p^q) = 1 } and Card(Vi(p)) = 2 (n + 1 ). 

ii) Voo(p) = {q^ / doo{p,q) = 1} and Card(Voo(p)) = 3"^+^ - 1 . 

In the literature ( [13], [14] and [17] ) the di-neighbor is referred as (2(n + 1))- 
neighbor and the doo-neighbor as (3^^+^ — l)-neighbor. 

Let T C we denote by the complement of T in Z^^+^; = 

Z^+^ - T. 

A binary (n+l)-digital image is an ordered uplet V = (Z^^+^, 77., 77), where H 
is a finite subset of and IZ represents the adjacency relation in the whole 

lattice. In this paper the adjacency relation IZ will be taken as follows : 

two elements of H are said to be 77.-adjacent if they are -adjacent, two 
elements of are said to be 77.-adjacent if they are 7oo- adjacent, and an 
element in H is 77.-adjacent to an element in if they are doo-adjacent. 

Let T C Z^^+^ and p G Z^^+^ , p is said to be 77.-adjacent to T if p is 77.-adjacent 
to some point in T. T is said to be strongly thin if and only if any element 
of T is 77.- adjacent to all 77.-components of T^. It is said to be separating if and 
only if has exactly two 77.-components. 

If T C Z^^+^ and p,q ^ T then an 77.-path from p to in T is a sequence of 
distincts points (pi, • • • ,Pm) in T such that pi = p and Pm = q and pi is IZ- 
adjacent to Pi+i, 1 < i < m. T is 77.-connected if given any two elements p 
and g in T there is an 77.-path in T from p to < 7 ; an 77.-component of T is a 
maximal 77.-connected subset of T. 



4 Polyhedral Representation of a Binary (n+l)-Digital 
Image 

Let li denote the open unit interval ]ri,r^+i[ or the single point for some 

n+1 

integer r^. A 7-cube Ck is defined as e/c = 71 Ii where k of the /^’s are intervals 

2 = 1 

and n -h 1 — 7 are single points. Thus Ck is an open 7-cube embedded in 

n+1 

The closure of e/c, will be denoted by Note that = II Ii. We denote by 

2=1 

Som{ek) the set of all vertices of Obviously we have Som{ek) C Z^^+^. The 



Digital n-Pseudomanifold and n-Weakmanifold 



41 



subscript of a k-cnhe will be omitted when irrelevant to the argument at hand. 
A face of a k-cube e is a k'-cube e' such that Som{e') C Som{e). We will write 

e' :< e. 

A cubical complex AT, is a finite collection of k-cubes in some 0 < k < 

n + 1, such that: 

^ if e G AT, then all faces of e belong to AT, and 
^ if e, e' G AT, then e H e' = 0. 

Now, we will construct the continuous analog of a binary (n + l)-digital image 
p = 7^, AT). Intuitively, this construction consists of ’filling in the gaps’ 

between black points of P, and must be consistent with the 7^- adjacency relation 
of V. Precisely, let C{H) be a collection of A-cubes in 0 < k < n+1, defined 

as follows: 

C(AT) = {e : k — cube, 0 < k < n 1 / Som{e) C H} 

C{H) is a cubical complex, the underlying polyhedron |C(AT)| will be called 

the polyhedral representation of V. 

For each x G e{x) will denote the A-cube, 0 < /c < n + 1, that contains x. 

From the construction of |C(iA)|, we can deduce some natural properties: 

Remark 2. Let V = ,1Z, H) . We have : 

1. if X G |C(iA)|, then e{x) G C(7A). 

2. if X G — |C(7A)|, then at least one vertex of e{x) belongs to 

3. each component of |C(iA)| or of — |C(7A)| meets 

The following theorem expresses the fundamental properties that permit us 
to relate digital topology to Euclidean space topology. 

Theorem 3. Let V = H) . 

1. |C(iA)| n Z^+i = H and (R^+i - |C(7A)|) n Z^+^ = iAZ 

2. Two points in H are in the same IZ-eomponent of H iff they are in the same 
eomponent of\C{H)\. 

3. Two points in are in the same IZ-eomponent of iff they are in the 
same eomponent ofW^~^^ — |C(7A)|. 

The boundary of a eomponent A of |C(7A)| meets the boundary of a eompo- 
nent B ofMT~^^ — |C(iA)| if and only if there is a point in An Z^+^ that is 
IZ-adjaeent to a point in B n . 

The proof of the above theorem and other properties related to the concept of 
continuous analogs in a binary (n + l)-digital image are given in [10]. 

5 Digital n-Weakmanifold and Digital n-Pseudomanifold 

We denote by AT a cubical complex such that \K\ is a connected polyhedron. 
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Definition 5. n-weakmanifold 

K will be called a cubical n-weakmanifold without boundary if and only if 

1. K is homogeneously n- dimensional. 

2. for each vertex p G K, Lk{p;K) is a combinatorial (n — l)-sphere. 

The underlying polyhedron \K\ is called an n-weakmanifold without boundary. 

The definition of the n-weakmanifold is a weak formulation of n-manifold given 
in definition 3, it uses uniquely the property of link in vertices of K. For 1 < n < 2 
those two notions are equivalent. 

Definition 6. n -pseudomanifold 

K will be called a cubical n -pseudomanifold without boundary if and only if 

1. K is homogeneously n- dimensional. 

2. Every (n — l)-cube of K is a face of exactly two n-cubes of K. 

3. If e and e' are two distincts n-cubes of K, then there exists a sequence 
ei,... of n-cubes in K such that e\ = e, Cu = e' and Ci meets 

in an (n — l)-cube for 1 < i < u. 

The underlying polyhedron \K\ is called an n -pseudomanifold without boundary. 
Remark 3. It is not difficult to see that 

1. If 7T is a cubical n-pseudomanifold without boundary then the barycentric 
subdivision of iT is a combinatorial n-pseudomanifold without boundary. 

2. Let p be a vertex of K. If Lk(p; K) is a combinatorial (n — l)-sphere, then 
Lk(p; K) is a cubical (n — 1) -pseudomanifold without boundary. 

Let (Ji be a /c-cube of K and p be a vertex of K such that p ^ Som{(Ji). 
We denote by p.cii the {k + l)-cube, if it exists, such that ({p} U Som{(Ji)) C 
Som{p.(Ji). 

Let (j and (Ji be respectively a (A; -h l)-cube and a k-cube of K such that ai -< a. 
If p G Som{a) — Som{a\) then p.cri = a. 

Proposition 1. Any cubical n-weakmanifold without boundary K is a cubical n- 
pseudomanifold without boundary. 

Proof. We have to prove properties 2 and 3 in the definition 6. 

^ Let a be an (n — l)-cube of K, we will show that cr is a face of exactly 
two n-cubes of K. 

Let p G 5om(cr), and a' an (n — 2)-cube of Lk(p, K) such that a' -< a. It is easy 
to see that p.a' = a. Lk(p, K) is a combinatorial (n — l)-sphere, by using part 2 
of remark 2, we deduce that a' is a face of exactly two (n — l)-cubes (cri, a 2 ) of 
Lk(p, K). This implies that p.a' = cr is a face of exactly two n-cubes (p.cri, p.a 2 ) 
of K, otherwise a would be a face for more than two (n — l)-cubes of Lk(p, K). 

^ Let cr and a' be two n-cubes of iT, we will show that there exists a sequence 
ei, . . . , of n-cubes in K such that ei = a^ Cu = a' and meets in an 
(n — l)-cube for 1 < z < n. 

Let p G Som{a) and p' G Som{a'). Since \K\ is a connected polyhedron, there 
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is a sequence (po, • • • iPk) of vertices of K that joins p = po to p' = pkj i.e, 

k-l 

,U bi>Pi+i] ^ l-^l- For h 0 < i < k, Pi-i and belong to Lk(p^, K). 

2=0 

Moreover, any two (n — l)-cubes and e' of Lk(p^,i^) can be joined by a 
sequence of (n — l)-cubes e^, . . . , in Lk(p^, K) such that consecutive (n — 1)- 
cubes of this sequence share a common (n — 2)-cube. So, any two n-cubes of K 
having pi as a vertex can be joined by a sequence of n-cubes in K such that 
consecutive n-cubes of this sequence share a common (n — l)-cube. 

Let (Jn,pi denotes an n-cube of K such that pi and belong to Som{an,pi)- 
Thus, and can be joined by a sequence of n-cubes in K such that 

consecutive n-cubes of this sequence share a common (n — l)-cube, for 0 < i < k. 
It is the same thing for the pair (cr, cTn,po) ^^d {an,p,^^cr'). So there exists a 
sequence of n-cubes in K that joins a to a' such that any consecutive n-cubes 
of this sequence share a common (n — l)-cube. 

This completes the proof. □ 

Definition 7. Let V = 7^, TT), and H he IZ-eonneeted. 

1. H will be ealled digital n-weakmanifold if C{H) is a euhieal 
n-weakmanifold without boundary. 

2. H will be ealled digital n-pseudomanifold if C{H) is a euhieal 
n -pseudomanifold without boundary. 

Proposition 2. LetV = 

1. If H is a digital n-weakmanifold, then H is a digital n-pseudomanifold. 

2. If H is a digital n-pseudomanifold, then H is a separating set (see seetion 3 
for the definition of separating set). 

3. If H is a digital n-pseudomanifold, then H is strongly thin. 

Proof. 

1. Let iL be a digital n-weakmanifold. 

C{H) is a cubical n-weakmanifold without boundary. By using proposition 1, we 
deduce that C{H) is a cubical n-pseudomanifold without boundary. This implies 
that iL is a digital n-pseudomanifold. 

2. Let iL be a digital n-pseudomanifold. 

C{H) is a cubical n-pseudomanifold without boundary. By using the part 1 
of the remark 3 and part 3 of the remark 1, we deduce that |C(iL)| is an n- 
pseudomanifold without boundary. So, the theorem 2 enables us to assert that 
|C(iL)| divides in two domains {Int, Ext) and is the common boundary 

of both domains. By using Theorem 3, we deduce that has exactly two IZ- 
components ( IntnJP^^ denoted by Int{H), and ExtnZ'^~^^ denoted by Ext{H) 
). Furthermore dint = |C(iL)| = dExt. 

3. Let p G H. Since dint = |C(iL)| and p G |C(iL)|, we can assert that : 

Ve > 0, ®(p, e) ^ 0, with ®(p, e) = {x G / doo(p, x) < e}. 

Let e = ^. 3x ^ Int / doo{p^x) < 

Let e be an (n + l)-cube such that p G Som{e) and x G e. 

e{x) ^ C(iL), otherwise x G |C(iL)| (for the definition of e{x) see section 4). So, 
3pi G Som{e{x)) / pi ^ H. Note that e{x) ^ e. 
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[x,pi] C e(x). Since pi G and x G Int, then pi G Int {Int is a connected 
component of — |C(i7)|). Int fl 17^^^ = so pi G 

Since e{x) 7 e, then p and p' are vertices of e. This implies that p is <ioo -adjacent 
to p', i.e, p is 7^-adjacent to 

In the same way, we prove that p is 7^-adjacent to Ext{H). 

This completes the proof. □ 

Theorem 4. Let V = H). 

1. If H is a digital n-pseudomanifold, then H verifies the digital version of the 
Jordan- Brouwer separation theorem. 

2. If H is a digital n-weakmanifold, then H verifies the digital version of the 
Jordan- Brouwer separation theorem. 

Proof. It is easy to deduce this theorem from proposition 2. 

6 Conclusion 

We have defined in a binary (n + l)-digital image V = ^IZ^H)^ where 

LI = (2{n + 1), 3^^+^ — 1), a new class of objects H (digital n-weakmanifold and 
digital n-pseudomanifold ) that verifies the digital version of the Jordan- Brouwer 
separation theorem, for n > 2. 
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Abstract. Efim Khalimsky’s digital Jordan curve theorem states that 
the complement of a Jordan curve in the digital plane equipped with 
the Khalimsky topology has exactly two connectivity components. We 
present a new, short proof of this theorem using induction on the Eu- 
clidean length of the curve. We also prove that the theorem holds with 
another topology on the digital plane but then only for a restricted class 
of Jordan curves. 



1 Introduction 

The classical Jordan curve theorem says that the complement of a Jordan curve 
in the Euclidean plane consists of exactly two connectivity components. Efim 
Khalimsky’s digital Jordan curve theorem states the same thing for the digital 
plane T? . Of course we must use a suitable definition of the concept of digital 
Jordan curve, as well as a suitable topology on 7? . In this case 7? is given the 
Cartesian product topology of two copies of the digital line Z equipped with the 
Khalimsky topology. 

A proof of Khalimsky’s theorem was published in 1990 by Khalimsky, Kop- 
perman and Meyer [1990]. They refer to earlier proofs by Khalimsky (E. D. 
Halimskii) [1970, 1977]. Our first purpose in this note is to present a new, short 
proof. 

The idea of the proof is simple. For the smallest Jordan curves (having four or 
eight points) the conclusion of the theorem can be proved by inspection. Given 
any other Jordan curve J, we construct a Jordan curve J' which has shorter 
Euclidean length and is such that its complement has as many components as the 
complement of J. Since the possible Euclidean lengths form a discrete set, this 
procedure will lead to one of the smallest Jordan curves, for which the theorem 
is already established. The construction of J' can intuitively be described as 
follows: attack J where its curvature is maximal and shorten it there; it cannot 
offer resistance from within. 

We then consider a topology on which is not a product topology. In 
contrast to the Khalimsky topology it has the property that every point is either 
open or closed — there are no mixed points as in the Khalimsky plane. We prove 
that the Jordan curve theorem holds for this topology for a restricted class of 
Jordan curves. 
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2 Connectedness and Adjacency 

A topological space is said to be connected if it is nonempty and the only sets 
which are both open and closed are the empty set and the whole space. A subset 
of a topological space is called connected if it is connected as a topological space 
with the induced topology.^ A connectivity component of a topological space is 
a connected subset which is maximal with respect to inclusion. A component is 
always closed. A connected subset which is both open and closed is a component 
(but not necessarily conversely). 

If X and Y are topological spaces and f : X ^ Y a, continuous mapping, 
then the image f{A) of a connected subset A of X is connected. We may apply 
this result to a situation where we define a topology on a set Y using a mapping 
f : X ^ Y from a topological space X : 

Proposition 1. Let f : X ^ Y be a surjective mapping from a connected topo- 
logical space X onto a setY . Equip Y with the strongest topology such that f is 
continuous. Then Y is connected. 

In particular we shall use this result with X = R and Y" = Z to define connected 
topologies on the digital line Z. 

In any topological space we shall denote by N{x) the intersection of all neigh- 
borhoods of a point x. Spaces such that N{x) is always a neighborhood of x were 
introduced and studied by Aleksandrov [1937]. Equivalently, they are the spaces 
where the intersection of any family of open sets is open. In such a space all 
components are open. 

The closure of a subset A of a topological space will be denoted by A. We 
note that x G {y} if and only if ^ G N{x). 

A Kolmogorov space (Bourbaki [1961:I:§1: Exerc. 2]; also called a To-space) 
is a topological space such that x G N{y) and y G N{x) only if x = ^. It is 
quite reasonable to impose this axiom; if x belongs to the closure of {y} and 
vice versa, then x and y are indistinguishable from the point of view of topology. 
(We should therefore identify them and consider a quotient space.) 

The separation axiom T\ states that N{x) = {x}. It is too strong to be of 
interest for the spaces considered here. 

Two points X and ^ in a topological space Y are said to be adjacent \i x ^ y 
and {x^y} is connected. We note that {x, is connected if and only if either 
X G N{y) or ^ G N{x). We shall say that two points x, z are second adjacent if 
X ^ z; X and z are not adjacent; and there exists a third point y ^ Y such that x 
and y are adjacent and y and z are adjacent. 

3 Topologies on the Digital Line 

It is natural to think of Z as an approximation of the real line R and to consider 
mappings / : R ^ Z expressing this idea. We may define /(x) to be the integer 

^ According to Bourbaki [1961:I:§11:1] the empty space is connected. Here I follow the 
advice of Adrien Douady (personal communication, June 26, 2000). In the present 
paper it will not matter whether the empty set is said to be connected or not. 
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closest to x; this is well-defined unless x is a half-integer. So when x = n -h ^ we 
have a choice for each n: shall we define f{n ^) = n or f{n -h = n -h 1? If we 
choose the first alternative for every n, thus putting f~^{n) = ] n — n -h |] , the 
topology defined in Proposition 1 is called the right topology on Z; if we choose 
the second, we obtain the left topology on Z; cf. Bourbaki [1961:I:§1: Exerc. 2]. 
Khalimsky^s topology arises if we always choose an even integer as the best ap- 
proximant of a half-integer. Then the closed interval is mapped to 0, so 

{0} is closed for the Khalimsky topology, whereas the inverse image of 1 is the 
open interval ] ^, | [, so that {1} is open. 

A Khalimsky interval is an interval [a, 6] H Z equipped with the topology 
induced by the Khalimsky topology on Z. A Khalimsky eirele is a quotient space 
Zm = Z/mZ of the Khalimsky line for some even integer m ^ 4. (If m is odd, 
the quotient space receives the chaotic topology, which is not interesting.) 

4 Khalimsky Jordan Curves 

Khalimsky, Kopperman and Meyer [1990:3.1] used the following definitions of 
path and arc in the Khalimsky plane. We just extend them here to any topo- 
logical space. We modify slightly their definition of a Jordan curve [1990: 5.1]. A 
Jordan curve in the Euclidean plane is a homeomorphic image of the circle 
R/Z, and similarly a Khalimsky Jordan curve is a homeomorphic image of a 
Khalimsky circle. 

Definition 1. Let Y he any topologieal spaee. A Khalimsky path in Y is a 
eontinuous image of a Khalimsky interval. A Khalimsky arc is a homeomor- 
phie image of a Khalimsky interval. A Khalimsky Jordan curve in Y is a 
homeomorphie image of a Khalimsky eirele. 

Sometimes Khalimsky Jordan curves are too narrow. We impose a condition 
on them to make their interior fatter: 

Definition 2. Let J he a Khalimsky Jordan eurve in a topologieal spaee Y . We 
shall say that J is strict if every point in J is seeond adjaeent to exaetly two 
points in J. 

We note that if x, z G J are second adjacent, then the intermediary y required 
by the definition need not belong to J . Thus the concept of strict Jordan curve 
is not intrinsic. 

A three-set {x,y,z} such that all three points are adjacent to each other 
can be a Khalimsky path but never a Khalimsky arc. This follows from the 
fact that in a Khalimsky interval [a, 6], the endpoints a and h are not adjacent 
unless h = a -h 1. Let us say that a three-set {x, z} in a topological space 
is a forbidden triangle if all points are adjacent to each other. The absence of 
forbidden triangles is therefore a necessary condition for Khalimsky arcs and 
consequently for Khalimsky Jordan curves, and it is often easy to check. 

Different topologies may induce the same adjacency structure. However, when 
the adjacency structure is that of a Khalimsky circle, the topology of the space 
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must also be that of a Khalimsky circle. More precisely we have the following 
result. 

Theorem 1. Given a subset J of a topologieal spaee Y, the following eonditions 
are equivalent. 

(A) J is a Khalimsky Jordan eurve. 

(B) J has at least four points, and for every a G J, J \ {a} is homeomorphie to 
a Khalimsky interval. 

(C) J is finite, eonneeted, with eardinality at least 4 , and eaeh of its elements 
has exaetly two adjaeent points. 

(D) J has the adjaeeney strueture of a Khalimsky eirele, i.e., J = {xi,X2, 

for some even integer m ^ 4 and for eaeh j = 1, ...,m, Xj-i and Xj^i and no 
other points are adjaeent to Xj. (Here we eount indiees modulo m.) 

Proof. If (A) holds, then for every a G J, J \ {a} is homeomorphie to a Kha- 
limsky circle minus one point, thus to a Khalimsky interval. Conversely, suppose 
that (B) holds and consider J \ {a} and J \ {6} for two points a,b ^ J which 
are not adjacent. Then we have homeomorphisms of J \ {a} and J \ {b} into 
a Khalimsky circle Z^. We can modify them by rotating the circle so that the 
two mappings agree on J \ {a, b}. Then they define a local homeomorphism of J 
onto Z^, thus a homeomorphism; we have proved (A). 

It is clear that (A) implies (C) and (D). 

Suppose that (C) holds. Then call an arbitrary point x\ and one of its ad- 
jacent points X2 and then go on, always choosing xj+i after Xj so that is 
adjacent to Xj but not equal to any of the already chosen x\, ...,Xj-\. After a 
while we must arrive at a situation where there are no new points left, i.e., we 
arrive at Xm and the two points adjacent to Xm are Xm-i and a point which 
has already been chosen, say Xk- A priori k may be any of 1,2, ...,m — 2, but 
in fact the only possibility is k = 1 — any other choice would mean that Xk had 
three adjacent points contrary to the assumption. It remains to be seen that m 
is even. That Xj and Xjj^i are adjacent means that we have either Xj c N{xjj^\) 
or Xjj^i C N{xj). If Xj c N{xjj^i), then we cannot have Xjj^i C N{xjj^2)^ for 
that would imply that Xj belonged to N{xjj^2)^ so that Xjj^2 would have three 
adjacent elements, viz. Xj, Xj^i and xj+3. So the statement Xj G N{xjj^\) holds 
only for j of a certain parity. Since this is true modulo m, that number must be 
even. Thus we have proved (D). Conversely, (D) obviously implies (C) since (D) 
is just a more detailed version of (C). 

It remains to be seen that (D) implies (A). First of all it is clear that, assum- 
ing (D), N{x) can never have more than three elements — a fourth element would 
mean that x had at least three adjacent points. So N{xj) C {xj-\,Xj,Xj^i}. 
Considering the three points we note that either Xj-i G N(xj) 

or Xj G N{xj-i), and that Xj G N{xjj^\) or G N{xj). However, these 

alternatives cannot be chosen at will, for as we have seen in the previous para- 
graph Xj-i G N(xj) implies Xj ^ N(xjj^i). Consider now the case Xj-i G N{xj). 
Then G N(xj), so that N(xj) D {xj-i,Xj,Xjj^i}. On the other hand we 
know already that N{xj) has at most three elements; we conclude that N{xj) = 
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{xj-i^Xj^Xjj^i}. By the same argument, N{xjj^2) = ^j+2, ^i+s}- There- 

fore N{xjj^i) = {xj+i}, and we have proved that T is a Khalimsky circle 
where points with indices of the same parity as j have three-neighborhoods 
and points with indices of the other parity are open. The other possibility, 
viz. that Xj G N{xj-i)^ can be reduced to the former by just shifting the indices 
one step. 

It follows from property (C) that two Khalimsky Jordan curves can never 
be contained in each other. More precisely, if J and K are Khalimsky Jordan 
curves and J ^ then J = K. 

A point on a Khalimsky Jordan curve J consisting of at least six points 
has at least two second adjacent points; with the order introduced in property 
(D), Xj-2 and Xj^2 are second adjacent to Xj and Xj-2 7^ ^j+2 when m > 4 . 
Then Xj±\ serve as intermediaries, but there may also exist other intermediaries. 
When a Jordan curve is not strict and m > 4 , then some point, say Xj, has at 
least one second adjacent point in addition to Xj-2 and Xj+2, say Xk- Then an 
intermediary h such that Xj and b are adjacent and b and Xk are adjacent cannot 
belong to J. 

Suppose now that Y" is a metric space with metric d. Since every Khalimsky 
arc r is homeomorphic either to [ 0 , m — 1 ] H Z or to [ 1 , m] H Z for some m, it can 
be indexed as {xi, ..., where the indices are uniquely determined except for 
inversion. We may define its length as 

m— 1 

length(T) = 

1 

Similarly, a Khalimsky Jordan curve can be indexed as {xi,...,x^}, where 
the indices are uniquely determined up to inversion and circular permutations, 
and its length can be defined as 



m 

length(J) = Xj), 

1 



where we count the indices modulo m. 

We shall use the following norms in to measure distances in 7 ?\ 



||x||p = ||(xi,X2)||p = 



{\xi\p + \x2\py^^, 

max (|xi|, |x2|), 



X G R^, 1 ^ p < -hoo; 

X G R^, p = 00. 



5 Khalimsky’s Digital Jordan Curve Theorem 

The Khalimsky topology of the digital plane is the Cartesian product topology 
of two copies of the Khalimsky line Z. A point x = (xi,X2) in the product 
Z^ = Z X Z is closed if and only if both xi and X2 are closed, thus if and only if 
both xi and X2 are even; similarly x is open if and only if both coordinates are 
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odd. These points are called pure] the other points, which are neither open nor 
closed, are called mixed. 

Perhaps the quickest way to describe Khalimsky’s topology Tqo on 7? is this: 
We first declare the nine-set 

U^ = {x^ Z2; ||x||oo < 1} = {(0,0), ±(1,0), ±(1,1), ±(0,1), ±(-1,1)} (1) 

to be open, as well as all translates Uoo-\~c with ci, C 2 C 2Z. Then all intersections 
of such translates are open, as well as all unions of the sets so obtained. As a 
consequence, {( 1 ,- 1 ), ( 1 , 0 ), ( 1 , 1 )}, the intersection of Uoo and Uoo + ( 2 , 0 ), and 
{( 1 , 1 )}, the intersection of Uoo and Uoo + ( 2 , 2 ), are open sets, and {( 0 , 0 )} is a 
closed set. The sets {(1,0)} and {(0, 1)} are neither open nor closed. 

Theorem 2. Given a subset J of Z‘^ equipped with the Khalimsky topology, the 
eonditions A, B, C and D of Theorem 1 are all equivalent to the following. 

(E) J = for some even integer m ^ 4 and for all j, 

and and no other points are adjaeent to x^^^ ; moreover eaeh path eonsisting 

of three eonseeutive points x^^\ turns at x^^^ by 45° or 90° or 

not at all if x^^^ is a pure point, and goes straight ahead if x^^^ is mixed. 

Here we use the informal expression “turn by 45°” etc. with reference to angles 
in the Euclidean plane of which we consider the Khalimsky plane to be a subset. 

Proof. If (D) holds, we see that J cannot turn at a mixed point and cannot 
turn 135° at a pure point — otherwise we would have a forbidden triangle. So 
(D) implies (E). Conversely, (E) is just a more precise version of (D), so (E) 
implies (D). 

In this section we shall measure the lengths of Khalimsky Jordan curves using 
the Euclidean metric, d{x,y) = ||z — ^|| 2 . It is not possible to use || • ||i or || • ||oo 
in the proof of the Jordan curve theorem. 

The smallest possible Jordan curve in is the four-set 

J4 = (x G Z2; ||a; - (1, 0)||i = 1} = {(0, 0), (1, -1), (2, 0), (1, 1)}. (2) 

We add all translates of J 4 by a vector c c Z^ with c\ + C 2 even and call these 
the Jordan eurves of type J 4 . 

There is also a Jordan curve having eight points, 

Js = {xe Z± ||x||oo = 1} = ?7oo \ {(0,0)}. (3) 

This curve and all its translates by a vector c c Z^ with ci + C 2 even we call the 
Jordan eurves of type Jg • 

Let us agree to call the three-set 

T = {(1,1), (0,0), (1,-1)} (4) 

and rotations of T by 90°, 180° and 270°, as well as all translates of these sets 
by vectors c c Z^ with c\ + C 2 even, a removable triangle. It turns out that 
elimination of removable triangles is a convenient way to reduce Jordan curves, 
as shown by the following lemma. 
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Lemma 1. Let J he a Jordan eurve in the Khalimsky plane and assume that J 
eontains the three-set T defined by (f). Define 

J' = (Jx{(0,0)})U{(l,0)}. 

Then either J = or else J' is a Jordan eurve sueh that C J' and C J have the 
same number of eomponents, and length( J') = length( J) — 2^/2 + 2. 

Proof. Assume first that (2,0) G J, thus that J D J4. Then necessarily J = J4. 

Next we suppose that (2,0) ^ J. Then J' is a Jordan curve: J' is a set where 
the new point (1,0) plays exactly the same role topologically as the old point 
(0, 0) in J . Thus J' is also homeomorphic to a Khalimsky circle. 

Finally we must check that the number of components in C J' is the same as 
that of CJ'. Indeed, (1,0) and (2,0) belong to the same component of CJ, and 
(0, 0) and ( — 1,0) belong to the same component of C J'. 

Theorem 3 (Khalimsky ’s Jordan curve theorem). Let us equip the digital 
plane 7? with the Khalimsky topology Tqo (see (1)). Then for any Khalimsky 
Jordan eurve J inT? , the eomplement C J = \ J has exaetly two eonneetivity 

eomponents. 

Proof. The complement of J4 consists of A = {(1,0)} and the set B of all 
points X with \x\ — 1\ P \x 2 \ > 1. It is obvious that these two sets are connected. 
Moreover, A is closed and open in CJ4, so it is a component. Therefore, also B 
is closed and open in C J4 and also a component. The proof for Jg is similar. 

Thus we know that the conclusion of the theorem holds for Jordan curves of 
types J4 and Jg. 

Next we shall prove that if J is not of the kind already treated, then there 
exists a Jordan curve J' of strictly smaller Euclidean length such that CJ and 
C J' have the same number of components. After a finite number of steps we must 
arrive at a situation where the hypothesis is no longer satisfied, which means 
that we have a Jordan curve of type J4 or Jg, for which the complement has two 
components as we already proved. 

The construction of J' is as follows. First we may assume, in view of Lemma 1, 
that J contains no removable triangles. Define 

= m.i{x2]x G J). 

Thus X 2 ^ U 2 for all points z G J with equality for at least one x. Consider a 
horizontal interval 



iJ = {(xi,a2)} + {(0,0),(l,0),...,(p,0)} 

which is maximal with respect to inclusion and consists of points in J with 
ordinate equal to a2- The maximality implies that the two points {x\ — l,a2) 
and {x\ + p + 1, tt2) do not belong to J . Then we see that p must be an even 
number, but we cannot have p = 0, since that would imply that J contained a 
removable triangle, contrary to the assumption. Thus contains at least three 
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points. Moreover, at the endpoints of JT, J must turn upwards. Indeed, since 
(xi — l,a 2 ) does not belong to J, exactly one of the points {xi — l,a 2 + 1), 
(xi,a 2 + 1) belongs to J; when we go left from (xi,a 2 ), the curve must turn 
upwards by either 45° or 90°; it cannot turn downwards. Similarly, the curve 
turns upwards by 45° or 90° when we go right from the last point in JT, viz. from 
{xi a2). 

We now consider the set X of all maximal horizontal intervals / in J such 
that J turns upwards at the endpoints of I. The previous argument served just 
to prove that there exists such an interval. Now there exists an interval K G X 
of smallest length. 



i^ = M + {(o,o),(i,o),...,feo)}, 

containing q-\-l points for some even number q ^ 2. We shall assume that K is of 
smallest length also among all intervals that can be obtained from the intervals 
in X by rotating them 90°, 180° or 270°. 

To simplify the notation we may assume (after a translation if necessary) 
that y = (0, 0), so that 

K = {(0, 0), (1, 0), (g, 0)} = [(0, 0), (g, 0)] n 

Case 1. J turns upwards by 45° at both ends of K. This means that (—1, 1) 
and + 1, 1) both belong to J. In this case, we define 

J' = (JxiC)U(iC + (0,l)). 

This operation shortens the Euclidean length by 2\/2 — 2 (but it does not shorten 
the length). We note that the interval iC + (0, 1) is disjoint from J; otherwise 
some point in K would have three adjacent points. Moreover iC + (0, 2) must be 
disjoint from J. Indeed, if {K + (0,2)) n J were nonempty, then either J would 
contain a removable triangle (contrary to our assumption) or there would exist 
a subinterval iC' of iC + (0, 2) contained in J and such that J turns upwards at 
its endpoints; thus K' C X. This subinterval must have fewer than q^l points, 
since (0, 2) and {q, 2) cannot belong to J — otherwise there would be a removable 
triangle in J. Now a shorter interval is impossible, since K is by assumption an 
interval in X of shortest length. One checks that J' is a Jordan curve. Indeed, 
the points of K + (0, 1) play the same role topologically in J' as do the points 
of iC in J. The number of components in the complement of J' is the same as 
for J. 

Case 2. J turns upwards by 90° at one end of K. Assume that (0, 1) G J, the 
case (g, 1) G J being symmetric. Then also (0, 2) G J . We consider the subcases 
2.1 and 2.2. 

Case 2.1. (2,2) ^ J. We cut off a corner, i.e., we remove (0,1), (0,0), (1,0), 
and add (1,1). This operation shortens the Euclidean length by 4 — 2v^ (but J' 
has the same /^-length as J). Since (1, 1) and (2, 2) belong to the same component 
of C J, and (0, 1), (0, 0), (1,0), and ( — 1,0) belong to the same component of C J', 
the number of components in the respective complements are the same. 
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Case 2.2. (2,2) G J. We consider four subcases, 2. 2. 1.1, 2. 2. 1.2, 2.2.2. 1 and 

2 . 2 . 2 . 2 . 

Case 2. 2. 1.1. (2,1) G J, (1,2) G J. Then J contains a Jordan curve of 
type Jg, more precisely J D (1, 1) + Jg. So J must be equal to that curve. 

Case 2. 2. 1.2. (2, 1) G J, (1, 2) ^ J. Remove the five points (0, 1), (0, 0), (1, 0), 
(2,0), (2, 1), and add (1,2). Thus J' is shorter by 4. We can check that J' has 
all desired properties. 

Case 2. 2. 2.1. (2, 1) ^ J, (1,2) G J. Turn 90° to reduce to case 2. 2. 1.2. 

Case 2. 2. 2. 2. (2,1) ^ J, (1,2) ^ J. This case cannot occur since q is smallest 
possible. To see this, define I' as the set of all points (2, 2), (3, 2), ..., {q' ^ 2) G J 
with q' as large as possible. If J turns upwards at (^^2), then I' belongs to X 
with q' < q^ which contradicts the definition of K and q. If on the other hand J 
turns downwards at {q' , 2), then there exists a vertical interval consisting of three 
points, which becomes an interval in X if we turn it 90°, thus again contradicting 
the definition of X. 

6 The Jordan Curve Theorem for Another Topology 

We define a topology ti on by first declaring the five-set 

?7i = {xgZ"; ||a;||i<l} = {(0,0),±(l,0),±(0,l)} (5) 

to be open, then all translates U c with c G Z^, ci + C2 G 2Z to be open, as 
well as all intersections of such translates (cf. (1)). This implies that {(1,0)} is 
open, and that the origin is closed. In fact, all points with x\ -\- X 2 G 2Z 

are closed, and all points with xi +X 2 ^ 2Z are open; there are no mixed points. 
This topology was described by Wyse et al. [1970] and Rosenfeld [1979:624]. 
The four-set 

J'= {(0,0), (1,0), (1,1), (0,1)} (6) 

is a Jordan curve for t\. However, it is not strict, for a point in has only 
one second adjacent point. Its complement is connected, so the Jordan curve 
theorem does not hold. The set Jg defined by (3) is a Khalimsky Jordan curve 
and its complement has exactly two components. Also Jg is not strict, because 
the point (1,0) has three second adjacent points, viz. (0, 1), (0, —1) and ( — 1,0). 
Another example is the twelve-set 

Jl2 

= {(0, 0), (1, 0), (2, 0), (2, 1), (3, 1), (3, 2), (3, 3), (2, 3), (1, 3), (1, 2), (0, 2), (0, 1)}. 

It is a Jordan curve, not strict, and its complement has three connectivity compo- 
nents, viz. an infinite component and the two singleton sets {(1,1)} and {(2, 2)}. 

Theorem 4. Let T? he equipped with the topology t\ just defined (see (5)). Then 
the eomplement of every striet Jordan eurve has exaetly two eomponents. 
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Proof. For the proof we shall use the fact that equipped with the topology 
Ti is homeomorphic to the subspace of all pure points in the Khalimsky plane. 
This fact was used also by Kong, Kopperman and Meyer [1991:915]. 

Let X be the digital plane with the topology ti, and Y the Khalimsky 
plane (Z^ with the topology Too). Consider the mapping (p: X ^ Y defined by 
(p(^x) = (xi — X 2 , Ti +X 2 ). Its image (f{X) is the set of all pure points in K, and if 
we equip it with the topology induced by Y it is homeomorphic to X. Moreover, 
the image of any Khalimsky Jordan curve J in X is a Khalimsky Jordan curve 
in Y. Therefore Y \ (p(J) has exactly two components by Theorem 3. We claim 
that (p{X) \ has exactly two components. It is clear that this set has at 
least two components, so the problem is to prove that a component AofY J) 
gives rise to a connected set A n (p{X), i.e., that the pure points in A form a 
connected set. 

To this end, assume that a, a' c An ip (X), and consider a Khalimsky arc 
{a = ..., = a'} 

contained mY\ip{J). (Connectedness in Y is the same as arcwise connectedness; 
cf. Khalimsky et al. [1990: Theorem 3.2].) We shall prove that this arc can be 
replaced by another consisting only of pure points. So assume that aC) is a 
mixed point. Then its predecessor and its successor aC+^) are both pure 

points. Without loss of generality, we may assume that = (0,0), aC) = 

(0, 1), and aC+i) = (0,2). We may then replace a^) by one of the pure points 
(—1,1), (1,1), because both of them cannot belong to ip{J). To see this, suppose 
that (1,1), (—1,1) G ip{J). Then ( — 1,1) would be a second adjacent point to 
(1,1), and this point has, by hypothesis, exactly two second adjacent points in 
ip{J) (considering everything in the space ip{X)). However, none of them can 
be equal to ( — 1,1), for the only possible intermediaries would then be (0,0) 
and (0, 2), none of which belongs to (p{J). (In a strict Jordan curve, one of the 
possible intermediaries to a second adjacent point must belong to the curve.) 
This contradiction shows that not both of (1,1) and ( — 1,1) can belong to ip{J). 
Thus we may define b = ( — 1,1) or b = (1,1) so that b ^ p^{J) and observe that 

is a Khalimsky arc with a mixed point replaced by a pure point. After finitely 
many such replacements we obtain an arc connecting a and a' and consisting only 
of pure points. This shows that ip{X) \ p{J) has at most as many components 
as Y \ ^(J); therefore exactly two components, and then the same is true of 
X\J. 

7 Conclusion 

We have showed that digital Jordan curves can be subject to a simple induction 
process: their lengths in a suitable metric form a discrete set, and if we show 
that a curve has a certain property when a shorter curve has the same property. 
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the induction works, starting from some very short curves. Using this idea, we 
have presented a short proof of Khalimsky’s digital Jordan curve theorem, which 
is valid for the Khalimsky topology on the digital plane, where each point has 
eight adjacent points. We have also considered a topology for the digital plane 
where each points has four adjacent points. For this topology the Jordan curve 
theorem does not hold in general, but it does for a restricted class of curves. 
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Abstract. The paper presents a new method of investigating topologieal 
properties of three-dimensional manifolds by means of eomputers. 
Manifolds are represented as finite eell eomplexes. The paper eontains 
definitions and a theorem neeessary to transfer some basie knowledge 
of the elassieal topology to finite topologieal spaees. The method is 
based on subdividing the given set into bloeks of simple eells in sueh a 
way, that a ^-dimensional bloek be homeomorphie to a ^-dimensional 
ball. The bloek strueture is deseribed by the data strueture known as 
”eell list” whieh is generalized here for the three-dimensional ease. 
Some experimental results are presented. 



1 Introduction 

Topologieal knowledge plays an important role in eomputer graphies and image 
analysis. Images may be represented in computers only as finite sets. Therefore it is 
usual to perform topological investigations in a Hausdorff space and then to transfer 
the results to finite sets. One of the aims of the present investigation is to demonstrate 
that topological investigations may be performed directly in finite sets on which a 
To-topology is defined. Such a topological space can be represented in computers. We 
demonstrate here a new tool for investigating 3 -manifolds by means of computers: the 
three-dimensional cell list. The same tool may be implemented for economically 
encoding and analyzing three-dimensional images, e.g. in computer tomography. 



2 State of the Art 

It is known from the topological literature that the problem of the complete 
classification of 3 -manifolds is still unsolved while the classification of 2-manifolds is 
known since about hundred years [3]. In recent time some efforts have been made to 
use computers for investigating 3 -manifolds. 

In [9] the following method based on the notion of a spine was suggested. A spine 
[2] is some kind of a two-dimensional skeleton of the 3 -manifold: if ^ is a 
polyhedron, if K collapses to Z [12, p. 123], and if there is no elementary collapse 
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of L, then Z is a spine of K. A standard [2] or special [9] spine of a manifold has the 
same fundamental group as the manifold. In [9] the notion of complexity ^ of a 
3 -manifold was introduced. It is the number of vertices (0-cells) in the so called 
almost special spine of the manifold under consideration. It has been shown that the 
singular graph of a special spine is a regular graph of degree 4. There are only finitely 
many different spines corresponding to a given regular graph of degree 4. Thus it is 
possible to enumerate all spines with a given number of vertices. 

Matveev has also introduced the so called T-transformation, which transforms a 
spine of a given manifold to another spine of the same manifold, which may be 
simpler. He also uses topological invariants introduced in [14]. More than 1000 
3 -manifolds of complexity k up to nine have been analyzed by means of this 
method [9]. 

We suggest here another method of using computers for the investigation of 
3 -manifolds. According to our method a 3 -manifold is represented as an abstract cell 
complex (ACC) [6] with a minimum number of cells. It is encoded by the cell list as 
described in Section 5. It is easy to see that homeomorphic cell lists correspond to 
combinatorially homeomorphic manifolds. The question whether the minimum cell 
list of a 3 -manifold is unique is recently open. There is the hope that in the case that it 
is not unique, the number of different cell lists of a 3 -manifold (of a limited 
complexity) with a minimum number of cells is not too large, so that all such lists 
may be exhaustively tested by a computer whether they are combinatorially 
homeomorphic to a cell list of some already known manifold. In this presentation we 
describe our method of computing the cell list with a minimum number of cells for a 
given 3 -manifold. 



3 Basic Notions 

It is known [10] that any 3 -manifold may be triangulated and that homeomorphic 
3 -manifolds are combinatorially homeomorphic. Two complexes are called 
combinatorially homeomorphic if their simplicial schemata become isomorphic after 
finite sequences of elementary subdivisions [12, p. 24]. However, simplicial 
complexes contain too many elements and are therefore difficult to process. Simplices 
may be united to greater cells by an operation inverse to the subdivision: a 
subcomplex combinatorially homeomorphic to a A:-simplex (or equivalently to a 
^-ball) may be declared to be a ^-dimensional cell or a ^-cell. In what follows we shall 
write "homeomorphic” for "combinatorially homeomorphic”. 

While simplices are mostly considered as subsets of a Euclidean space we prefer to 
work with ACC's [6]. An ACC is a set of abstract cells. A non-negative integer is 
assigned to each cell. It is called the dimension of the cell. The set is provided with an 
antisymmetric, irreflexive and transitive binary relation called bounding relation. If 
the cell Cl bounds the cell C 2 it is usual to write Ci<C 2 . A cell can only bound another 
cell of higher dimension. 

ACC's differ both from simplicial and Euclidean complexes in so far that a cell is 
never a part of another cell. This property makes it possible to easily introduce the 
notion of open subsets of an ACC and thus to define a To-topology on it in accordance 
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with classical axioms [6]. Although an ACC is a quotient of some Hausdorff space we 
do not consider the eells as subsets of a Hausdorff spaee, whieh subsets are infinite 
ones and therefore not representable in eomputers. We rather eonsider eells as 
elements of an abstract finite set. This is another advantage of the ACC's sinee the 
topological space of a finite ACC may be directly and completely represented in a 
computer. Thus there is no neeessity to eonsider theoretieal problems in a Hausdorff 
spaee (whieh is not representable in eomputers) and then to transfer the results to a 
different set represented in the eomputer. This advantage of the ACC's is widely used 
in the present investigation. 

One of our methods of representing 3 -manifolds in eomputers eonsists in 
eonstrueting a four-dimensional ACC in the eomputer, in defining a strongly 
eonneeted subset of the ACC and in ealeulating the boundary of the subset. 

To make the number of eells as little as possible we subdivide the ACC A 
representing a 3 -manifold into subsets eaeh of whieh is homeomorphic to an open 
A:-ball. We call such a subset a k-dimensional block cell or a k-block of A block b\ 
of A is said to bound another bloek Z ?2 of ^ if eontains a eell of A whieh bounds 
another eell of A eontained in Z? 2 . In this way a bounding relation is defined on the set 
of blocks of A and the set becomes an ACC B called the block complex of A. The 
bloeks are cells of B. The topology of the block complex 5 is a quotient topology of 
that of the underlying ACC A, however, there is no necessity to consider it as a 
quotient topology of an Euclidean space. 



4 Incidence Structures 

4.1 The Main Idea 

In topological literature manifolds are often represented as cell complexes. Thus e.g. 
the surface of a torus may be represented as a complex consisting of a 0-cell, two 
1 -cells and one 2-cell (Fig. la). This representation has the advantage of being very 
simple. 





b 



Fig. 1. Representations of the surface of a torus (a) and of a simple complex (b) 
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However, if one would try to interpret this representation as an ACC, diffieulties 
would oeeur sinee e.g. the ACC's eorresponding to Fig. la and Fig. lb are the same: 
the same sets of four eells, the same bounding relation and the same dimensions of the 
eells. The differenee between these two eomplexes is that eaeh of the 1 -eells in Fig. 
la bounds the 2-eell two times, on both sides. This may be seen, if one eonsiders the 
embedding of the eomplex in an Euelidean spaee: a neighborhood of a point on the 
1-eell eontains two half-disks eaeh of whieh lies in one and the same 2-eell. However, 
there is no possibility to deseribe this relation in the language of the ACC's. 

Sinee one of our aims is to eonsider a purely eombinatorial approaeh with no 
relation to a Euclidean space we consider the possibility to overcome this difficulty by 
introducing the notion of an incidence structure. 

Definition PB: A A:-block is called proper if its closure is homeomorphic to a 
closed ^-ball. A block complex is called proper if all its blocks are proper. 

Thus when considering Fig. la as a representation of a block complex then it is not 
a proper one: though each A:-block is homeomorphic to an open A:-ball the closures of 
the blocks are not homeomorphic to closed A:-balls. 

An example of a proper block complex for the surface of a torus {n=2) is shown in 
Fig. 2. The only drawback of this representation is that it has too many blocks as 
compared to Fig. la. 




Fig. 2. A proper block complex of the surface of a torus 

It is possible to reduce the number of blocks of a proper block complex while 
uniting two adjacent A:-blocks which are not mutually simple (see Section 6 for the 
definition of "simple") by an operation inverse to the elementary subdivision. It may 
happen, that each of the united blocks was incident to one and the same third block. 
Then the union of this two blocks would be incident to the third block twice, at two 
different locations. In such a case we can loose some information about the 
topological structure of the set of blocks incident with the united block since the 
description of a block complex as an ACC cannot indicate that a block is multiply 
incident with one and the same other block. 

To overcome this drawback we introduce for each block the so called incidence 
structure. 

Definition IN: Two cells of an ACC are called incident with each other if either 
they are identical, or one of them bounds another one. 

Definition IS: The incidence structure of a block BC of a proper block complex K 
is a subcomplex of K containing all blocks incident with BC except BC itself 

This subcomplex may be described as an ACC: by the set of its blocks, each block 
represented by its label, and the bounding relation. 
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To preserve the topologieal information about the set of bloeks ineident with a 
given bloek BC the ineidenee stmeture of BC must be stored before the uniting of 
bloeks. During the uniting of two bloeks the label of one of them in the ineidenee 
stmeture must be replaeed by the label of another one. In this way it beeomes possible 
that the label of one and the same block multiply occurs in the incidence structure of 
another block. 

The incidence structures of all blocks of a block complex must be stored in a data 
stmeture which is a generalization of the cell list [6]. The former cell list was 
designed to describe two-dimensional Cartesian ACC's [7, 8] where a point may be 
incident with at most four lines. In the generalized three-dimensional cell list the 
number of blocks incident with a point or with a curve in 3D is not limited. This 
property is important for transformations of block complexes during topological 
investigations. 



4.2 Incidence Structures in Multidimensional Spaces 

We will show in what follows that the incidence structure of any block of a 
multidimensional block complex representing a closed manifold is similar to the 
union of two topological spheres. This fact is the basis of the development of data 
stmetures enabling an economical representation of multidimensional block 
complexes in computers. To prove the necessary theorem we remain the reader some 
definitions. 

An ACC A is an Alexandroff space [1,5] and hence there exists in A the smallest 
open neighborhood of each cell c^A. It is the set containing c and all cells of A, 
bounded by c. We denote it by SON(c, A). For the incidence stmeture we need the 
subcomplex SON(c, A) without the cell c itself: we denote it by SON*(c, A)= 
=SON(c,v4)-{c}. 

The closure Cl(c, A) is a notion dual to SON(c, A). It is the set containing c and all 
cells of A, bounding c. Again, we need the set without c itself: Cl*(c, A)= 
=Cl(c, A)-{c}. The incidence structure of a proper block BC is the union: 

IS(BC, A)=SON*(BC, A)uCl^(BC, A), (1) 

The incidence structure of a non-proper block containing fewer cells must be 
computed while starting with that of the original proper block complex and uniting 
some blocks which are not mutually simple. 

Definition BI: An isomorphism between two complexes, which retains the 
bounding relation, is called B -isomorphism. BI: A-^B is a B-isomorphism iff for any 
ai, 02 ^ A, a\<a 2 implies BI{ax)<BI{a 2 ). 

Theorem SN: The set SON*(c^, M”) of any A:-cell of an /7-manifold M" is 
B-isomorphic to an (/7-A:-l)-dimensional sphere if does not belong to the boundary 
dM"" and if 0<A</7-l. The set Cl*(c^, M”) is then B-isomorphic to an {k-\)- 
dimensional sphere. 

To prove the Theorem we prove at first the particular case of A:=0, which is the 
contents of the following 

Lemma: The set SON*(c^, M”) of a 0-cell is B-isomorphic to an {n-\)- 

dimensional sphere. 
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Proof of Lemma: According to the definition of an ^-manifold M"" the SON of a 
point (i.e. of a 0-cell) is an open ^-ball B'". The frontier of B'" is an (n-l)- 

sphere Consider the set ^=SON*(c^, M'"). Each cell c^g S has some cells in 
which bound c^. Let us join them to a (A:-l)-dimensional block a block 

corresponding to is a subset of homeomorphic to an open (^-l)-dimensional 
ball, containing all (^-l)-dimensional cells which bound and containing 

also the closures of all cells of dimension k-2 each of which bounds at least two of 
the With other words: 



b^^-\c^)=U{c)-dU{c\ where f/(c^)=Cl*(c^)nS^”“^\ (2) 



All such blocks compose an (^-l)-dimensional block complex of 

homeomorphic to 8^""“^^ and thus being a topological (w-l)-sphere. The map 
/: takes each A:-dimensional cell to a (A:-l)-dimensional block of 

corresponding to Under rather general suppositions about M" the map / 
retains the bounding relation: for any two cells a, heS, a<b implies I{a)<I{b). Thus / 
is a B -isomorphism. 

Proof of the Theorem: Consider the 80N of a 0-cell and a A:-cell 
c^G 80N*(c^, Vf"), According to Lemma c will be mapped (as an element 

of 80N*(c^, M"")) by / onto a (A:-l)-dimensional cell of an (^-l)-dimensional 
sphere 8uppose, the Theorem is true for a (A:-l)-dimensional cell of a manifold. 
8ince 8^'"“^^ is a manifold, must be B-isomorphic to a sphere of the 

dimension: 

(n-l)-(k-l)-l=n-l-k+l-l=n-k-l. (3) 

However, / maps 80N*(c^, M") onto letter onto 

Thus, if the Theorem is true for a (A:-l)-dimensional cell it is also true for a 
A:-dimensional one. According to Lemma the Theorem is true for A:=l. Therefore, it is 
true for any 1<A^(^-1). 

To prove the assertion concerning Cl* it is sufficient to consider a set dual to M", 
where each ^-cell is replaced by an (^-A:)-cell, the bounding relation is reversed and 
the 80N of a cell c is replaced by its closure. 

Fig. 3 shows the 80N* of a point in a 3D Cartesian ACC and its B-isomorphic 
map onto the surface of an octahedron, which surface is a 8^. The 80N* of a point 
contains 8 cubes V\ to Vg (V 2 is removed), 12 faces and 6 edges. The B-isomorphic 
surface of an octahedron contains 8 faces, 12 edges and 6 points. 




Fig. 3. The SON of a point (a) and the B-isomorphie surfaee of an oetahedron (b) 
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The above results are illustrated in Table 1 showing the ineidenee struetures of 
interior eells (or bloeks) of a three-dimensional Cartesian ACC A^. In eases of spaees 
of dimension 2 and 3 the union of SON* with Cl* happens to be B-isomorphie to a 
two-dimensional sphere for eells of any dimension. It should be noted that this faet is 
of no importanee for applieations sinee the implementation of a data strueture 
isomorphie to the union is simpler then that of S^. Table 1 shows the ineidenee 

struetures of eells of all dimensions k and the 2-spheres B-isomorphie to them. 

We use the ineidenee struetures to deseribe non-proper bloek eomplexes. Sueh a 
deseription is the list of ineidenee struetures of all bloeks of a eomplex, ealled the cell 
list [6, 7]. The eell list for 3D eomplexes is deseribed below in Seetion 5. Aeeording 
to Theorem SN the ineidenee struetures of a A:-bloek in an ^-dimensional manifold M"" 
eonsists of two eomplexes one of whieh is B-isomorphie to and the other to 

S(^-i) Thus the topologieal strueture of M'" may be deseribed as a list of deseriptions 
isomorphie to spheres of lower dimensions. Therefore it may be reeursively 
eomposed of struetures isomorphie to and whieh are a pair of points and a 
eyelieally elosed sequenee respeetively. 



Table 1. Ineidenee struetures in a 3D spaee 




5 The Three-Dimensional Cell List 

On the base of Theorem SN it becomes possible to construct the three-dimensional 
cell list as a set of tables while each line of a table describes the incidence structure of 
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a block of the block complex of a 3 -manifold. Each incidence structure is described as 
one or two ACC’s each of which is B-isomorphic to a A:-sphere with k<2. 

Let us demonstrate an example. The tables below are constituents of the 
3-dimesnional cell list of the 3 -manifold with boundary shown in Fig. 4. 







Fig. 4. Example of a simple 3-manifold with boundary 



Table 2. List of the branch points (0-blocks) 



Label 




Line 


Ason 


Pointer 


Chained list 


Pi 


2 


-Li 


5 


Zi^ 


-Fi ^ Vi ->-F2 -^V2^+F3->0 


+L 2 


5 


Z2^ 


F, ^+Fi^0 


Pi 


2 


+Li 


5 


Zj^ 


-Fi Fi ->-F2 ■^V2->+F3->0 


-Li 


5 


Z4^ 


-Fj^ Fj^+Fj^ F, ^+Fi^0 



The value Vnn denotes the number of lines (1 -blocks) incident with Pj,J=l,2, while 
the value Ason denotes the number of blocks in the SON* of the corresponding line 
Li, z=l,2. These blocks compose the chained list where they are represented in the 
order of the rotation around Lf corresponding to the right screw. The sign before 
the label of an oriented face Fj, shows that the normal to Fj, points against the rotation. 
The pointer points to the first element of the chained list. A zero symbol at the end 
of the chained list denotes that the list is not cyclically closed which may be the case 
for manifolds with boundary. 

The list of the lines (1 -blocks) contains two special blocks: the starting and the end 
point of the line Li and besides that the cyclic sequence of faces and volumes incident 
with Li as described above. The value Ason denotes the number of blocks in the SON* 
ofA. 



Table 3. List of the lines (1-bloeks) 



Label 


Start 


End 


Ason 


Pointer 


Chained list 


Li 


Pi 


Pi 


5 




-Fi Vi Fz^+F3 ^0 


Li 


Pi 


Pi 


5 




Fz^+Fz^ Fi ^+Fi ^0 
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The list of faces (2-blocks) has a similar structure: the incidence structure of a face 
Fi contains two special blocks (these are now the two incident 3 -blocks) and a cyclic 
sequence of Na blocks in C1*(F/). The first symbol is repeated at the end of the 
chained list to show that the sequence is closed. 



Table 4. List of faces (2 -blocks) 



Label 


+Vol 


-Vol 




Pointer 


Chained list 




- 


F, 


4 




P,^-L2^P2^-L,^P: 


F2 




F2 


4 




P,^-L2^P2^-L,^P: 


F, 


- 


F2 


4 


Z,^ 


Fi-^-hLi-^F2-^+L2-^P 1 



Table 5. List of volumes (3 -blocks) 



Label 


Ny 


Face 


A^ci 


Pointer 


Chained list 


F, 


2 


+F, 


4 




Pl^-L2-^P2-^-Li^Pl 


-F2 


4 


Zi,^ 


Pl^+Li^P2^+L2^Pl 


F 2 


2 


+F2 


4 


Zl2~^ 


P,^-L2^P2^-L,^Pi 


+F 3 


4 


Zi3^ 


Pl^+Li^P2^+L2^Pl 



The value denotes the number of faces Fj incident with z-1,2, while the value 
Nci denotes as before the number of blocks in Cl*(Fy). 

A presentation of the fundamental group of a given complex may be computed 
from its cell list by the method suggested by Poincare [11] and proved by Tietze [13]. 
According to this method it is necessary to find the spanning tree of the 1-dimesnional 
skeleton of the complex and ignore all 1 -cells in the tree. Each of the remaining 

1 - cells is a generator, the concatenation of the generators in the perimeter of each 

2- cell, being equated to identity, is a relation of the presentation of the fundamental 
group. 

In a similar way cell lists for manifolds (with and without boundary) of greater 
dimension may be constructed. The list consists of incidence structures each of which 
is B-isomorphic to a sphere of some lower dimension and thus may be described by a 
cell list of lower dimension. Thus e.g. in the cell list of a 5-manifold the incidence 
structure of a 0- and of a 5 -block is a cell list of dimension 4. The incidence structures 
of other blocks are lists of lower dimensions. 

The gained understanding shows that cell lists for block complexes of manifolds of 
any dimension may be constructed by means of a recursion: the cell list of an 
z 2 -manifold consists of lists of lower dimensions. 
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6 Computer Experiments 

6.1 Generating Block Complexes and Cell Lists of 3-Manifolds 

We use two methods of produeing bloek eomplexes and their eell lists in the 
eomputer. The first method implements the elassieal idea of gluing (or identifying) 
the faees of a polyhedron. The deseription of a polyhedron may be input into the 
eomputer manually, just in the form of a three-dimensional eell list eontaining a 
single 3-bloek and as many 2-bloeks as the number of faees. Also a list of desired 
identifieations of the faees and their elosures (speeifying the homeomorphism of the 
gluing) must be input. The eorresponding eomputer program replaees the labels of 
some bloeks and ealeulates the new ineidenee struetures whieh are unions of the 
initial ones. The result is a eell list of a 3 -manifold. 

The seeond method eonsists in defining a strongly eonneeted subset of a four- 
dimensional Cartesian ACC represented as a four-dimensional array in the eomputer. 
The boundary of the subset is then the desired 3 -manifold. It is a three-dimensional 
ACC. 

The bloek eomplex of a given ACC may be eomputed as follows. Consider two 
elosed ^-balls whose boundary interseetion is an (^-l)-ball. Then the union of the 
/ 2 -balls is again an / 2 -ball sinee uniting is a proeedure inverse to the elementary 
subdivision of an / 2 -eell. We eall sueh two / 2 -balls mutually simple or simple relative 
to each other. The union of the elosures of two mutually simple / 2 -eells or / 2 -bloeks is 
a closed subcomplex homeomorphic to a closed / 2 -ball B"". 

The program selects an arbitrary / 2 -cell of the given / 2 -dimensional complex as the 
seed of B". Then all / 2 -cells, which are simple relative to the growing ball B", are 
sequentially united with it, one cell at each step. The closures of the united / 2 -cells are 
labeled as belonging to the closure of the / 2 -block. When there are no more simple 
cells, the rest consisting of / 2 -cells which are not simple relative to B" can be 
subdivided into handles of indices 0 to 2 [4, p. 28]. All / 2 -cells of a handle and the 
cells of their closures which are not jet labeled get a label of the handle. This is 
accomplished in the order of decreasing indices: all handles of index 2 first, etc. 

Each handle of index k is then contracted to a ^-block, i.e. a corresponding ^-block 
is recorded in the cell list. The incidence structure of a block may be directly read 
from the closure of the / 2 -cells c" of the corresponding handle H, which cells are 
incident with the base of H\ the cells of Cl*(c") contain the labels of other handles 
having a common boundary with H, due to the labeling procedure described above. 

We have developed a computer program which automatically calculates according 
to the described algorithm the cell list of a two- or tree-dimensional orientable 
manifold without boundary. The manifold must be defined as the frontier of a 
strongly connected subsets of a four-dimensional Cartesian ACC. The program also 
minimizes the number of blocks while uniting pairs of 0-blocks (points) incident with 
a line (1 -block) until a block complex with a single 0-block, a single 3-block, m 
1 -blocks and m 2-blocks is obtained (the Euler number N^-N^^+N^-N^ =\-m+m-\ 
must be 0). 

Several examples of manifolds were successfully tested. As an example we show 
the results of investigating a well-known 3 -manifold S^xS^xS^ which may be obtained 
by identifying opposite faces of a cube. The minimized cell list of this manifold 




A New Means for Investigating 3 -Manifolds 67 



contains a single point, three closed curves, three faees eaeh spanned by two curves 
and a single volume. Since the eell list is redundant (its redundaney is neeessary for 
the purpose of a fast seareh) its eontents may be represented by that of the ineidence 
strueture of the single volume. As demonstrated above, the incidence strueture of any 
bloek of a 3 -manifold is B-isomorphic to a 2-sphere and henee may be projected onto 
the plane. Fig. 5 shows a planar projeetion of the ineidenee strueture of the 3 -bloek. 

In Fig. 5 elements of the same hatehing represent identified bloeks. The 2-bloeks 
are denoted by a, b and c. Primed symbols eorrespond to opposite orientations. The 0- 
and 1 -dimensional blocks are denoted by combinations of the symbols of the bounded 
2-blocks. 




Fig. 5. Incidence structure of the 3 -block of the block complex of the glued cube 

A presentation of the fundamental group of this manifold may be found as follows: 
there is a single vertex which is the spanning tree. Therefore all three 1 -blocks ab, ac 
and be are generators. Let us denote them by x=ab, y=ac and z=bc. Then the 
perimeter of the face a (the exterior area) contains the sequence xyx'y-l, the 
perimeter of the face b the sequence xzx'z-l and the perimeter of the face c the 
sequence y^'z-1. Thus the fundamental group is the free abelian group of rank 3. 



7 Conclusion 

The described method gives the possibility to compute automatically a representation 
of a two- or three-dimensional manifold as a cell complex with the (almost) minimum 
number of cells. This evokes the hope to test by means of a fast computer the 
combinatorial homeomorphism of 3 -manifolds as the isomorphism of cell complexes. 
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The method also makes it possible to compute automatically a presentation of the 
fundamental group of the given manifold. The method may be useful for further 
investigations of 3 -manifolds and may be a contribution for the solution of the still 
unsolved problem of classifying 3 -manifolds. 

The tree-dimensional cell list developed here may be also used for economically 
encoding and analyzing tree-dimensional images, e.g. in computer tomography, or 
time sequences of two-dimensional images in digital television. 
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Abstract. The concept of “nearness” , which has been dealt with as soon 
as one started studying digital images, finds one of its rigorous forms in 
the notion of proximity space. It is this notion, together with “nearness 
preserving mappings”, that we investigate in this paper. We first review 
basic examples as they naturally occur in digital topologies, making also 
brief comparison studies with other concepts in digital geometry. After 
this we characterize proximally continuous mappings in metric spaces. 
Einally, we show by example that the “proximite complexity” of a finite 
covering in a digital picture may be too high to be adequately depicted in 
a finite topological space. This combinatorial result may indicate another 
conceptual advantage of proximities over topologies. 



1 Introduction 

Recently there have been quite an intense investigation of topological structures 
in image processing, mostly in connection with the analysis of connectivity and 
the operation of thinning (see e.g. [2,4,9,13], etc.). An interesting attempt to 
introduce richer structures than those of topology, and replacing thus “local” 
continuity properties by a global notion of nearness, has been done in [12] where 
the authors contemplated the so called semi-proximity spaces as a theoretical 
tool in the image processing studies. In this paper we want to go on in a similar 
vain by shedding light on certain questions which implicitly announced them- 
selves in the paper [12], and by complementing the results of [12] with some new 
findings. (Our investigation here is relatively technical. The reader is supposed 
to be acquainted with the motivation for investigating digital topology and thus 
for the selection of the problems we pursue. Reading the papers [8,12,18] would 
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MAT, by the project of Czech Ministry of Education No. VS96049, and by the Grant 
Agency of the Czech Republic under the grant GACR 102/00/1679. 
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certainly be instrumental as it was for the authors. We hope, however, that our 
exposition is reasonably self-contained.) 

The paper is organized as follows: In the section 2 the proximity spaces are 
introduced and related to the topological spaces, in particular to the topology 
introduced by Marcus and Wyse. Section 3 investigates the proximities in met- 
ric spaces. In the section 4 we discuss the problem whether nearness of finite 
partitions allows for a suitable finite topology on the index set. The conclusion 
summarizes the paper. 

2 Proximity Spaces — In Relation to Topologies 
and Metrics 

We shall deal with proximity spaces as implicitly defined by Riesz [16] and 
intensely pursued by several authors ([3,5,6,14]). Our definition is a modified 
version of the definition by E. Cech [3]. 

Definition 1 (Proximity Space). A pair (X, tt) is called a proximity space 
if X is a set and tt is a binary relation on the power set of X, expX, which 
is subject to the following four requirements (A, 5, C,D^E G expX, the symbol 
A nonn B means that An B is false): 

(i) {AU B) TT C A TT C or B n C , 

(a) An B ^ A ^ and 5 7 ^ 0, 

(Hi) AnBjtzil}^ An B, 

(iv) An B ^ B n A. 

If{X, n) fulfils the following additional requirement (v), it is called a T-proximity: 

(v) if {a}nonnB , then there is a set A such that {a}nonn (X — A) and 
{d}nonn B for each d G A. 

The axioms of a proximity space reflect the properties we observe when we 
consider the common-sense nearness. It should be noted that the less plausible 
axiom of T-proximity - the axiom (v) - guarantees that the proximity in question 
induces a topology (the proximities induce only the closure spaces - see [3] 
and [12]). There is an important link of proximity spaces and topological spaces. 



Theorem 1 (Proximity Space, Topological Space). 

(i) Let (X, 7 t) be a proximity (resp. T-proximity) space. Let A = {x G X\{x}nA}. 
Then (X, “) is a closure (resp. topological) space (the closure ~ is said to be 
induced by the proximity n). 

(a) Let (X, “) be a closure (resp. topological) space and let (X, “) fulfil the 
following condition (the Cech Rq condition): Ifa^b G X, then a G {b} if and 
only if b ^ {a}. Let A n B A\^ B t^0. Then (X, n) is a proximity (resp. 
T-proximity) space and, moreover, (X, tt) induces (X, “). k 
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The proof of the above result is slightly technical (though essentially easy - 
one would use Th.23.B3 and Th.25.B9 of Cech [3]) and we omit it. 

By the previous result, we can associate closures (resp. topologies) to prox- 
imities (resp. T-proximities) although not all topologies are proximizable. On 
the other hand, observe that many proximities on X may induce the same 
topology on X (and, vice versa, a topology may give rise to several proximities 
which induce it). For the reader’s intuition, let us note that e. g. the following 
two proximities pi,P 2 on an infinite set X define the same (discrete) topology: 
Ap 2 B<^ either An B 9 or both the sets A and B are 

infinite. 

It is possible to adopt the notion of proximity as primary and view the notion 
of topology as secondary. One of the reasons for doing it is that the topology 
describes only the local character of points in a picture (or, if we want, the prox- 
imity of points and sets). But if we are to treat geometrical qualities of a picture - 
a situation which typically arises in image processing - it is the proximity of sets 
which matters most. E.g. in the case of scanned text documents the characters 
are the connected components of the black pixels of the thresholded image. The 
pixel sets of adjacent characters of words are closer to each other than between 
the words. 

Thus, besides the concrete reason obvious from the study of thinning and 
shape deformation [12], the general reason for investigating the proximity rela- 
tion lies in its fundamental role in all kinds of geometrically oriented considera- 
tions. 

In view of Theorem 1 and the investigation of connectedness in digital im- 
ages [20], the following proximity in is worth recording. Recall that if [r , s] G 
then by the 4- neighbour hood of [r, s] we mean the set {[r, s],[r ± l,s], 
[r,s ± 1]}. 

Definition 2 (Marcus— Wyse proximity). Let denote the subset of 
consisting of all points with the integer coordinates. Let A G Z^ , B G Z^ . Let us 
write A 7T B if either An B ^ tj) or there exists a point p = [r, s] G AU B such 
that the following assertion holds true: 

(i) r s is an odd number, 

(a) if p n A, then the 4-neighbourhood of p intersects B, 

(Hi) if p G B, then the 4-neighbourhood of p intersects A. 

The result of [20] (see also [11]) can be now reformulated proximity- wise. 
Following [12], let us agree to say that a set 5 in a proximity space (P, tt) is 
connected if S cannot be written in the way *5^ = AU B, where A,B are two 
non-proximal sets (i.e. A noniv B). 

Theorem 2. The graph-theoretic connectedness in Z^ induced by the 4-neigh- 
bourhood adjacency relation coincides with the proximite connectedness induced 
by the Marcus- Wyse proximity. 

Let us take up proximities on metric spaces (here all proximities will auto- 
matically be T-proximities). Let us consider a metric space, (M, p).Then (M, p) 



72 



Pavel Ptak and Walter G. Kropatsch 



can be viewed as a topological space with the closure A = {x G M \ p{x, A) = 0}. 
In this way the metric space (M,p) induces a “topological” proximity, de- 
fined as follows: AirfB^AnB^^. But (M, p) also induces a metric prox- 
imity, 7Vm- Write, for two subsets A and 5, p(A,B) = inf p(a,b) and set 

aeA,beB 

A TTm B p{Aj B) = 0. It is easily seen that if A 7Tt 5, then A Tim B but not 
necessarily the other way round. If, for instance, A is the graph in of the 
function f{x) = ^ and B is the x-axis in then A Tim B with respect to the 
Euclidean metric, but not A iit B. 

For a reader not trained in topology, let us explicitly clarify why (and when) 
the latter phenomenon (of the difference of tt^ and tt^) may occur. Recall that 
a metric space M is called compact ([3,5], etc.) if each sequence in M allows 
for a convergent subsequence in M. As known, a subset P of endowed with 
the Euclidean metric (taken form R'^) is compact if and only if it is closed and 
bounded. 

Theorem 3. Let (M,p) be a compact metric space. Then nt = i.e. in com- 
pact metric spaces the topological proximity agrees with the metric proximity. 

Proof. Suppose that A,B d M. It only remains to show that if A Tim B^ then 
A TTt B^ the other implication is always valid. Let A iTm B. We are to show 
that A n B 0. The relation A Tim B means p(A, B) = 0. It follows that for 
each n ^ N we can find points an ^ A^ bn ^ B such that p{an^bn) < 

By compactness, there is a subsequence, of {on} which converges to some 
element a G M, and there is a subsequence, bn^ of {bnj,} which converges to some 
b G M. Obviously, a = b and therefore the sequences {an A i^rn} converge 

to a common element a(= b). Since a G A and b ^ B and since a = 6, we see 
that An B A This completes the proof. 

The morphisms in the category of proximity spaces are the proximally con- 
tinuous mappings. 

Definition 3 (proximally continuous mapping). Let (Xi,7Ti), (X2,7T2) be 
proximity spaces. A mapping f \ X\ ^ X 2 is called proximally continuous if the 
following implication is true: 

If A^B c Xi and Atti B, then f{A) 712 f{B). 

Thus, proximally continuous mappings are those mappings which preserve 
proximity. It is easily seen that a proximally continuous mapping is automati- 
cally continuous when understood as a mapping between the respective topolog- 
ical spaces induced by proximities. A continuous mapping does not have to be 
proximally continuous even if we consider it with respect to the metric proximity. 



Example: Consider the function f{x) = x^ : (0, + 00 ) ^ (0, + 00 ). This mapping 
is obviously continuous but not proximally continuous. Indeed, let pn be such a 
sequence that {pn + |)^ — — 1* Let A = {pn \ n G N} and B = {pn + ^ I ^ ^ 

N}. Then A iTm B but f{A)non7Tm f{B). 
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3 Metric Proximities 

In this section we are going to prove that the metric proximity of a space deter- 
mines, up to a metric equivalence, the metric of the space. This relatively deep 
result has found applications in a number of geometric problems (see e. g. [5] for 
relevant comments). This result can be expressed in terms of “small” (=count- 
able) subsets of the metric space in question and therefore it may have bearing 
on digital geometry. Also, we see that metric considerations of digital pictures 
([17,18,19]) have a proximity character (i.e. can be expressed in proximity terms). 

Let us take up the proof of the result. We provide a simplified transparent 
proof based on elementary reasonings only. We also point out other features of 
metric spaces relevant to proximity. Recall first two standard definitions. 

Definition 4. Let (M, p) he a metrie spaee and let {an} he a sequenee in M . We 
say that {an} is a Cauchy subsequence if for eaeh e > 0 there is uq ^ N sueh 
that p{xm^Xn) < e provided n > no,m > no- We say that a sequenee{xn)neN 
in M is metrically discrete (of order e) if for any n,m e N we have p{xm,Xn) 
> e. 

The following proposition is essential in our argument. It may be interesting 
in its own right. Before we formulate it, let us agree to call the set Br{a) = {6 G 
M I p(a, h) < r} the r-ball around a. 

Proposition 1 (Sequence principle in metric spaces). Eaeh sequenee in 
a metrie spaee eontains either a Cauehy sequenee or a metrieally diserete suh- 
sequenee. 

Proof Take a sequence in a metric space and form the collection of all 1 -balls 
centered at each of its points. If each of these balls contains only finitely many 
points of the sequence, we can easily construct a subsequence of the given se- 
quence which is metrically discrete of order 1. It not, there is a 1-ball around 
a point of the sequence which contains infinitely many points of the sequence. 
Take these points and form the collection of all —balls centered at these points. 
If each of these | -balls contains only finitely many points, we can easily con- 
struct a metrically discrete subsequence of order If not, there is a point such 
that the ^-ball around it contains infinitely many points. Going on this way in- 
ductively, either the procedure stops at the n-th step and we have an ^-discrete 
subsequence, or we obviously obtain a Cauchy subsequence of the given sequence. 

We shall need one more metric notion. Let us recall it together with proximal 
continuity in metric space. 

Definition 5. Let {Mi, pi), (M 2 ,p 2 ) bo metrie spaees. Let f: Mi M 2 he a 
mapping. In aeeord with our general definition, we say that f is proximally 
continuous if the following property is fulfilled: 

If P and Q are subsets of Mi sueh that pi{P, Q) = 0, then p 2 {f{P), /(Q)) = 0. 

We say that f is metrically continuous if for any e > 0 there exists > 0 
sueh that whenever pi{x,y) < S, then P2{f{x),f{y)) < s (sueh a mapping f is 
sometimes ealled uniformly eontinuous) . 
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Let us now formulate and prove the main result of this section. In the effort to 
make the proof accessible for nonspecialists in topology, we use only elementary 
reasonings. The novelty seems to be the utilization of the sequence principle as 
established in Prop. 1. 

Theorem 4. A mapping between metrie spaees is proximally eontinuous if and 
only if it is metrieally eontinuous. 

A eonsequenee: If two metries on a set induee the same proximity, they have 
to he (metrieally) equivalent. 

Proof. A metrically continuous mapping between metric spaces is obviously 
proximally continuous. Let us take up the nontrivial implication of the theo- 
rem. 

Let /: Ml ^ M 2 be proximally continuous. Suppose / is not metrically 
continuous. It means that for some e > 0 there exist sequences {an)neN and 
{bn)neN in Mi such that pi{an,bn) 0 whereas P2(/(«n), /(^n)) > Let 

us look at the sequence f{an)neN- If h contains a Cauchy subsequence, then 
there is an infinite subset of this sequence which is all contained in an |-ball. 
Let us denote this subsequence by f{ank)keN‘ Then the sets {/(an^): k G N} 
and {/(^n/e)- k G N} are obviously |-apart (the triangular inequality), which is 
absurd. If there is a Cauchy subsequence of f{bn)neN^ we can apply an analogous 
reasoning. Suppose that neither of the former two cases applies. By the sequence 
principle, we can easily construct metrically discrete subsequences f{ank)keN 
and fibnjfjkeN of f{an)neN and f{bn)neN, respectively. For simplicity, let us 
denote them again by f{an)neN and f{bn)neN- 

Let us assume that these sequences are metrically discrete of order a. Define 
r = \ min{e, a} and form the collection of all the r-balls centered at f{an)neN‘ 
Note that each of them contains at most one of the elements f{bn)neN‘ Let us 
now proceed inductively. Put ni = 1. Take U2 G A^, 712 > 1 such that 
U 2 > h if f{bh) belongs to the r-ball centered at /(ui), and 
U 2 > k if the r-ball centered at f{ak) contains f{bi), 

(if neither h or k exists, we simply take U 2 = ni + 1). By induction, given ni < 
n 2 < ' - ' < Ui, take G N, such that 

> h if f{bh) belongs to the r-ball centered at /(a^J, and 

> k if the r-ball centered at f{ak) contains /(^^J. 

Note that, by our construction, if j G {uk\k G N}, then f{bj) does not belong 
to any of the r-balls centered in /(ttn^). The sets f{anf,)keN and f{bnf,)keN 
are therefore apart of the order r. Write A = {un/el/cew, B = {bnf,}keN- The 
pi{A,B) = 0 but pi{f{A),f{B)) > r. This means that / is not proximally 
continuous - a contradiction. This completes the proof. 

The main consequence of the latter theorem (and its proof) as far as the 
potential application in image processing goes is contained in the next theorem. 
It does not seem to be explicitly formulated in the literature but it is this for- 
mulation which may have relevant bearing on the digital images studies - it 
reduces proximities of general sets to proximities of small (=countable, digitally 
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accessible) sets. (Recall that a set is said to be countable if it has the smallest 
possible infinite cardinality, i.e. if it has the cardinality of natural numbers.) 

Theorem 5. Let pi,p 2 be two metries on a set M. If = 0 p 2 {A,B) 

= 0 for all countable subsets of M , then the metries pi, P 2 ciTe metrieally equiv- 
alent. In other words, if the proximities given by p\ and p 2 agree when restrieted 
to eountable subsets of M, then p\ and p 2 are metrieally indistinguishable. 

4 Near and Far Sets in a Finite Partition — Could 
‘^nearness” Be Controlled by a Finite Topology? 

Each picture can be viewed as a partition of the underlying set into a finite 
family of sets. The main point in understanding the partition is specifying which 
sets are “near” (proximal) and which are “far” (non-proximal) . In this section 
we exhibit an example which shows that the proximal relation of sets in a finite 
partition may be too complex to be described with the help of the notion of finite 
topology. This relates in a natural way our investigation here with topological 
studies in image processing (see [13,8,11,18,10,20], etc.). (Recall that by a finite 
partition of a set S we mean a mutually disjoint collection {Sk {k = 1 , 2 , . . . , n)} 

n 

of non-empty subsets of S such that aS'/c = *5^.) 

/c=l 

Let us now introduce an auxiliary notion. 

Definition 6 . Let (X, tt) be the metrie proximity spaee of the metrie spaee 
(M, p). Let P = {Sk I k = 1, 2, . . . , n} be a partition of X. Let us say that P 
is controlled by topology if there is a topology t on the setT = { 1 , 2 , . . . , n} 
sueh that Sk tt Si if and only if the set {k,£} is topologieally eonneeted as a 
topologieal subspaee of T . 

If each partition could be topologically controlled, which seems conceivable 
at first sight, we would find ourselves in an advantages situation in view of the 
understanding of finite topologies (see e.g. [18] and [10]). However, it is not 
necessarily the case. We will illustrate it by a simple example. Before, observe 
that we can restrict ourselves to open sets of the partition if we allow for a small 
degree of overlapping ( in practice, we have to allow for it anyhow in view of the 
imperfection of our measurement). 

Proposition 2 . Let (M,p) be a metrie spaee. Let {Sk \ k = l,2,...,n} be a 
partition of M . Then there is an e, e > 0 sueh that the following statement holds 
true: If Tk = {x ^ M \ p{x, Sk) < k = 1, 2, . . . , n, then eaeh Tk is an open 
subset of M , and T/^ H 7 ^ 0 if and only if Sk Tip Si. 



Proof. All sets Tk, /c = 1, . . . ,n are obviously open. Further, consider all cou- 
ples Sk,Si such that Sk non it p Si. Since there are only finitely many couples 
with this property, there is a sufficiently small 5 , e > 0 such that T/^ H = 0 
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Fig. 2. 



for all couples of T/^, Ti corresponding to the couples Sk^ Si considered. Since we 
have Sk Tim Si for the remaining couples, we infer from the definition of met- 
ric proximity that the neighbourhood sets T/c,T^, corresponding to these Skj Si 
fulfil Tk nTi ^ 0. The proof is complete. 

We see from the above proposition that we can simply construct the coun- 
terexample on the disjointness - intersection basis of an open covering. 

Example: Consider the Fig. 1 above. Let {Bi\i < 6} be the family of balls 
in as drawn in Fig. 1. Then the proximity of the family {Bi\i < 6} cannot 
be topologically controlled. 

Indeed, considering the couples of balls which are disjoint (resp. which over- 
lap), we see that the existence of the required topology would amount to having 
a topology on the set {1,2, 3, 4, 5, 6} such that precisely the following pairs would 
constitute connected subspaces: (1, 2}, {2, 3}, {2, 5}, (3, 4}, (3, 5}, and {5,6}. 
This would mean the existence of topology on the Fig. 2 above which would 
produce the 8-adjacency connectedness but this is impossible as shown in [13] 
and [11]. 



5 Conclusions 

We suggest that the notion of proximity space might be a useful tool for theoreti- 
cal studies in image processing. We exhibit basic examples and we link them with 
previous topological investigations (for instance, with the Marcus-Wyse topolo- 
gies). Then we analyze more thoroughly the metric proximities. As a main result 
we show that the metric proximity of small sets determines in a way the metric 
of the underlying space. The interpretation of this result reads, roughly, that 
if we can verify which sequences of points in a metric space are proximal and 
which are not, we can in a certain sense recover the metric. In the end we show 
that a topological result known from the investigation of 8-adjacency relation 
in disproves a conjecture about “topologizing” proximities in a partition. 
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Abstract. The morphological approach to image processing consists in 
probing the image structures with a pattern of known shape called struc- 
turing element. In this paper, we concentrate on structuring elements in 
the form of discrete line segments, including periodic lines. We inves- 
tigate fast algorithms, decomposition/cascade schemes, and translation 
invariance issues. Several application examples are provided. 



1 Introduction 

In mathematical morphology [15,20,21,9,23], image structures are extracted or 
filtered out by letting them interact with a pattern of known shape called struc- 
turing element. While discrete approximations of disk structuring elements are 
desirable in many applications, discrete line segments of a given length and ori- 
entation are best suited for processing thin structures such as roads in satellite 
images or ridges and valleys in fingerprints. In addition, cascades of morpho- 
logical operators with line segments can be used for producing discrete appro- 
ximations of disks. The goal of this paper is to study morphological operators 
based on discrete line segments. 

The paper is organised as follows. Issues related to the definition of discrete 
line segments are detailed in Sec. 2. Recursive van Herk’s algorithm for erosions 
and dilations is recalled in Sec. 3 together with its extension to arbitrary di- 
rections, grey scale structuring elements, and 3-dimensional images. Section 4 
concentrates on cascades of morphological operators with line segments. Parallel 
combinations of a bank of openings and closings with line segments for vary- 
ing orientations are investigated in Sec. 5. Before concluding, we show in Sec. 6 
that binary and grey scale convex hulls can be obtained by performing max 
computations along a series of periodic lines. 

2 Discrete Line Segments 

2.1 Connected Line Segments 

Rosenfeld [19] showed that a digital arc L is a digitization of a straight line seg- 
ment if and only if it satisfies the so-called chord property^ i.e., the line segment 
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joining any two points of L lies everywhere within a distance 1 of 5. Note that 
the Kim’s area property [13] and chord property are equivalent as pointed out 
later by the same author [14]. 

Bresenham [2] has proposed an attractive algorithm for generating discrete 
lines satisfying the chord property (see [4] for further improvements). This al- 
gorithm is especially attractive on systems not equipped for fast floating point 
operations because it requires only integer arithmetic operations. The pixel se- 
lection criterion is based on a distance criterion. For instance, Fig. 1 shows a 
continuous line segment of slope 3/5 over a square grid of pixels. The pixels 
belonging to the discrete line approximating this continuous line segment are 
determined using distance measurements along each vertical line linking the 
centres of the pixels of the grid. These vertical lines are represented by vertical 
dotted lines in Fig. 1. For each vertical line, the pixel whose centre is closest to 
the continuous line belongs to the corresponding discrete line. When the slope is 
larger than 1 , the horizontal dotted lines linking the centres of the pixels must 
be considered instead of the vertical ones. 




Fig. 1. Euclidean line of slope 3/5 and corresponding Bresenham discrete line: 
each time the Euclidean line cuts the vertical lines linking the centres of the 
pixels (i.e., the vertical dotted lines), the centre of the pixel which is closest to 
the Euclidean line defines a pixel of the discrete line 



Note that for a Bresenham line of slope in the form of an irreducible fraction 
y/x, the number of distinct line segments occurring along the discrete line equals 
max(|x|, 1^1). It is convenient to include the forms 0/1 and 1/0 for referring to 
horizontal and vertical lines respectively. The number of distinct line segments 
corresponds to the periodicity k of the elementary pattern occurring along the 
Bresenham line, i.e., k = max(|x|, \y\). In the sequel, we denote by Lxi^(x,y) fhe 
connected line segment obtained by considering A successive pixels of a Bresen- 
ham line of slope y/x^ starting from the ith pixel of the line {i G {1,... ,k}). Eor 
example, Eig. 2 shows the five line segments of length equal to 11 pixels occur- 
ring for a slope of 3/5: ^ 112 ^ 5 ^ 3 ), and 
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^lll,(5,3) ^112,(5,3) ^113,(5,3) ^114,(5,3) ^115,(3,5) 



Fig. 2. The five possible line segments occurring along a Bresenham line of slope 
3/5 (here for a length of 11 pixels). Note that all these line segments satisfy the 
chord property 



2.2 Periodic Lines 

Digital connected line segments at arbitrary orientation are broad approxima- 
tions of Euclidean line segments. This led Jones and Soille [11,12] to introduce 
the concept of periodic lines by considering only those points of the Euclidean 
line that fall exactly on grid points. 

In mathematical terms, a periodic line is defined as follows: 

2 = A-1 

P\,v = IJ iv, (1) 

2 = 0 

where A > 1 is the number of points in the periodic line and t; is a constant 
vector^. The vector v is in the form v = (x, where x^y G Z. Similarly 
to Bresenham lines, we define the periodicity k of the periodic line as follows: 
k = max(|x|, |^|). Eor example, Eigs. 3a, b, and c correspond to the periodic 
lines P 3 ,(i,o), ^ 3 , ( 1 , 1 ) and ^ 3 ,( 2 ,!) respectively. 



(a) (b) 

Fig. 3. Examples of periodic lines, (a) P 3 ,(i,o)- (b) P 3 ,(i,i)- (c) T 3 ,( 2 ,i)- Points 
belonging to a line are indicated by squares. The position of the origin of each 
line is indicated by a black square 




(C) 



An interesting property of a Bresenham line of slope in the form of an irre- 
ducible fraction y/x is that it is exactly covered by max(|x|, |^|) Euclidean lines 

^ Note that periodic lines were originally defined [12] as follows: U-Gq iv. Here, we 
have adapted the definition of periodic lines so that A equals to the number of pixels 
of the periodic line (rather than A H- 1 in the original definition), in accordance with 
the parameter A used for Bresenham connected line segments L. 
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of slope yjx. This property is illustrated in Fig. 4, for a line whose slope equals 
2/3. Indeed, due to the construction of a Bresenham line, the periodicity of the 
pixels in a line of slope yjx equals max(|x|, |^|) and all these pixels belong to a 
Euclidean line of slope yjx. In other words, any Bresenham lines of slope is 
covered by max(|x|, |^|) periodic lines of the same slope. This property is at the 
basis of the translation invariant implementation of half-plane closings described 
in Sec. 6. 



discrete Bresenham line 
of slope -2/3 



the three Euclidean lines of slope -2/3 
— exactly matching the pixels of 
the Bresenham line 



(a) 




Fig. 4. (a) A Bresenham line segment of slope —2/3. (b) The three Euclidean 
line segments matching all points of (a). Note that the intersection between each 
Euclidean line and the Bresenham line defines a periodic line Pz,{z- 2 ) 



2.3 Angular Resolution 

In a discrete grid, the angular resolution of a discrete line segment depends on its 
length. In a square grid, only 2n — 2 directions can be defined with a connected 
line segment of odd length equal to n pixels, and whose middle and extreme 
pixels are matched by the Euclidean line of the same orientation. By relaxing 
the condition regarding the middle pixel, there are 2(2n — 2) possible orientations 
for any n > 2. Eor example, Eig. 5 illustrates the 8 (reps. 16) possible orientations 
for a connected line segments containing 5 pixels. 




(a) (b) 



Fig. 5. (a) The 8 possible orientations for a line segment of 5 pixels and whose 
middle and extreme pixels are matched by the Euclidean line of the same ori- 
entation. (b) The 16 possible orientations by relaxing the condition concerning 
the middle pixel 



If we merely impose to the line segments to contain at least two pixels exactly 
matching the corresponding Euclidean line, the number of distinct orientations 
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equals 4card(F^) — 4, where Fi is the Farey sequence [3] of order z, and card(-) 
returns the number of elements (i.e., cardinal number). The Farey sequence Fi of 
order i > 1 is the ascending sequence of all fractions p/q for which 0<p/q< 1, 
q < ij and p and q are nonnegative integers with no common divisors other than 
1 (note that the form 0/1 is included in the sequence). A graphical representation 
of all possible discrete slopes in the range [0, 1] for a line segment whose length 
is less than or equal to 32 pixels is shown in Fig. 6. 




Fig. 6. Graphical representation of all possible discrete slopes in the range [0, 1] 
and for a length less than or equal to 32 pixels. Each slope is obtained by linking 
the origin (lower left pixel) to a given dark pixel 



3 Recursive Algorithm for Erosions and Dilations 

3.1 Principle [34,8] 

The recursive algorithm is due to van Herk [34] and, almost simultaneously, Gil & 
Werman [8] (see also [6]). We follow here the notations and description proposed 
in [23, pp. 77-78]. A 1-dimensional input image / of length nx is divided into 
blocks of size A, where A is the length of the line segment in number of pixels. 
The elements of / are indexed by indices running from 0 to nx — 1. It is also 
assumed that nx is a multiple of A. Two temporary buffers g and h of length 
nx are also required. In the case of dilation, the maximum is taken recursively 
inside the blocks in both the right and left directions (right for g and left for h). 
When both g and h have been constructed, the result for the dilation r at any 
coordinate x is given by considering the maximum value between g at position 
X -h A — o — 1 and h at position x — o denoting the coordinate of the origin 
of the structuring element (e.g., 0 for the first pixel of the line segment). This 
recursive dilation algorithm can be written as follows: 
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ifx = 0,A,...,(m-l)A, 

^ max[^(z — 1), /(z)], otherwise. 

^ f /(^)> if a; = mA - 1, (m - 1)A - 1, . . . , A - 1, 

^ [ max[/i(a: + 1), /(x)], otherwise. 

r(x) = max[^(x + A — o — 1), h(x + o)]. 

In [7], minor improvements to the recursive procedure are proposed. This is 
achieved by reducing the number of computations in the generation the forward 
and backward buffers and the merge procedure. Other enhancements for the 
simultaneous computation of min and max filters and openings / closings are also 
developed. When dealing with binary images, a recursive algorithm based on 
directional distance transforms also running in constant time is described in [17]. 



3.2 Extension to Arbitrary Directions and Periodic Lines 

Principle [26,27] The recursive procedure is directly applied to the image 
pixels falling along a line at given angle. The line is then translated and the 
whole procedure is repeated until all image pixels have been processed, i.e., until 
the translations of the line have swept the whole image plane. The direction of 
the translation depends on the slope of the line (see Fig. 7). By doing so, each 




Fig. 7. Depending on its slope, the discrete line is drawn from an appropriate 
image corner. It is then translated in a unique direction in order to sweep the 
whole image plane while avoiding overlapping pixels (the arrows indicate the 
translation direction) 



pixel is processed only once. When translating the line from its original position, 
the number of pixels falling within the image plane first increases, then remains 
constant, and finally decreases. If the image size along the translation direction is 
shorter than the distance between the extremity of the line falling off the image 
plane and the image plane, the constant zone is replaced by a zone where the 
line is increasing at one end while decreasing at the other. The number of pixels 
falling within the image after each translation can be efficiently updated using 
a run length coding of the discrete line. The algorithm is suited to line erosions 
and dilations in 3-dimensional images but requires a careful analysis of border 
effects. 
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The procedure extends directly to periodic lines. Once the periodic struc- 
turing element has been defined, the discrete Bresenham line having the same 
orientation is traced from one of the image corners. Since the connected periodic 
line has the same orientation as the Bresenham line, the latter can be translated 
along the line so as to process all pixels of the line falling within the image 
plane. As described in the previous paragraph, the line is then translated in an 
appropriate direction and the procedure is repeated until the whole image plane 
has been swept. Figure 8 shows an example with the periodic line T 3 ,( 2 ,i)- 

Translation Invariance Issues As already noticed in [27], the shape of the 
connected line segment varies slightly from one pixel to another for all orienta- 
tions not matching one of the principal directions of the digitisation grid. These 
variations are studied in detail in [31] while proposing solutions for achieving 
translation invariance wherever necessary. In general, there are k possible out- 
puts for a neighbourhood image operator iF by a line segment of length A pixels 
applied along a Bresenham line of slope y/x and sweeping the whole image 
definition domain. We denote them by ^BLx.,{x,y) where i G {1,... ,/c} and 
k = max(|x|, |^|). 

The recursive translation invariant strategy relies on the following structuring 
element decomposition: 



^nki,{x,y) ^ki,{x,y) ® Pn,{x,y)-! 


(2) 


where © denotes the Minkowski addition [16] and i G {1, . . 
following relationships hold 


. , /c}. Therefore, the 


^^nk^,(x,y) ^ ,(x ,y) ^ ^n,(x ,y) "> 


(3) 


^^nk^,{x,y) ^ ^k^,(x,y)^ ^n,(x,y) ’ 


(4) 


We also show in [31] that the union (i.e., point- wise maximum V for discrete 
grey scale images) of all k possible non-TI openings is identical to the union of 
the openings by the k possible line segments (the same result holds for closings 
(p by replacing the union with the point- wise minimum A): 


i=k i=k 

V lBLx-,{x,y) = \f lLx.,{x,y)i 
i=l i=l 


(5) 


i=k i=k 

A = A ^Lxi,(x,y)' 

i=l i=l 


(6) 



3.3 Extension to Grey Scale Structuring Elements 

The dilation of an image / with a grey scale (also referred to as volumic or 
non- flat) structuring element By is denoted by Ss^if) and is defined as follows 
for each point x: 

= m^{f{x + b) + By{b)}. 



(7) 
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(a) 



PIXELS IN THIS REGION HAVE ALREADY BEEN PROCESSED 



Direction of translation 
of the connected periodic line 



^ PIXELS IN THIS REGION HAVE NOT YET BEEN PROCESSED 

- PIXELS ALONG THIS BRESENHAM LINE ARE BEING PROCESSED (see (c)) 
(b) 




Result(8)=max[h(8) , g(12) ] 

(c) 

Fig. 8. Implementation of dilation with a periodic structuring element using 
a modified van Herk algorithm, (a) A periodic structuring element ^ 3 ,( 2 ,!), its 
origin being the black pixel, (b) The image plane after 9 translations of the 
corresponding Bresenham line, (c) Schematic of the algorithm: the value ing or h 
at the starting point of a cycle of arrowed arcs equals the original value in the 
input image / (e.g., g(Q) = f{6) or h{ll) = /(ll)). The value at the pixel pointed 
by an arrowed arc equals the maximum between the value of / at this position 
and the value at the beginning of the arrowed arc (e.g., h{2) = max[/(2), h{4:)]). 
This algorithm requires 3 max comparisons, whatever the number of pixels of 
the periodic line 
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The grey scale weights of a grey scale structuring element should be set according 
to the image intensity values. The erosion is defined by duality with respect to 
set complementation: [£By{f)]{x) = b) — By{b)}. 

Grey scale structuring elements should be used with care because the cor- 
responding erosions and dilations do not commute with scalings of the pixel 
intensity values [18,29]. Nevertheless, these structuring elements are useful for 
some applications such as the rolling-ball algorithm and the computation of 
shadows of an image seen as a topographic surface [32]. 

Let us show that the recursive procedure for computing min/max filters (see 
Sec. 3.2) extends to grey scale periodic structuring elements, i.e., structuring 
elements whose domain of definition is a periodic line and whose grey scale 
values are defined as the index of the point i in the periodic line multiplied by a 
real number s defining the grey scale slope: P{iv) = Vz G {0, 1, . . . , A— 1} [28]. 
We denote by P\^v,s grey scale periodic structuring elements of grey scale slope 
s. Examples of grey scale periodic structuring elements are presented in Fig. 9. 



0 











(a) (b) (c) 



Fig. 9. Grey scale periodic structuring elements, (a) T 3 ,( 3 ,i),-i. (b) T 3 ,(i,i), 2 - 
(c) ^3, (2,-1), 4 



The algorithm requires an additional buffer /'. The values of the input im- 
age are copied in this buffer which is partitioned into blocks as described in 
Section 3.2. The periodic structuring element is then positioned at the first pixel 
of each block and the weights are added to each pixel of the block whose intersec- 
tion with the structuring element is non-empty. The structuring element is then 
translated by one pixel to the right and the procedure is repeated until a weight 
has been added to all pixels of f' (there are k — 1 translations per block) . Once 
the values of the buffer f' have been calculated, the buffers g and h are com- 
puted from the buffer f' using the recursive procedure detailed in Section 3.2. 
Finally, the resulting value at each position equals the maximum value between 
the value in the buffer h at the current position and the value in the buffer g at 
the current position plus x + k{\ — 1). However, in order to ensure that the same 
weights are used for all positions, appropriate multiples of the slope s must be 
added to g and subtracted from h beforehand. More precisely, x mod A times s 
must be removed from h and (A — 1) — (x-hA — 1) mod A times s must be added 
to g. For clarity and conciseness, the case of a connected periodic line of A pixels 
(i.e., the periodicity k is equal to 1) with the origin matching the first pixel of 
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the periodic line (i.e., o = 0) is presented hereafter: 

f'{x) = f{x) + {x mod A)s 

. X _ f f'{x) if X mod A = 0, 

\ max.[g{x — 1), f'{x)] otherwise. 

. X _ f f'{x) if (x + 1) mod A = 0, 

[ max[/i(x + 1), f'{x)] otherwise. 

r{x) = max[^(x + A — 1) + ((A — 1) — (x + A — 1) mod A)s, h{x) — {x mod A)s]. 

Note that all divisions are integer divisions. This algorithm adapts directly to 
non-unitary periodicities (i.e., A; > 1). An example is given in Fig. 10 for the 
periodic line T 3 ,( 2 ,o),s- For conciseness, we use a structuring element of 3 pixels 
only. Since our algorithm requires 3 max comparisons per pixel whatever the 
number of pixels in the structuring element, speed gains are obtained for larger 
structuring elements. 



0 




s 




2s 



(a) 




Result(8)=max[h(8)-s, g(12)+2s] 

(b) 

Fig. 10. Recursive min/max filter with a grey scale periodic structuring ele- 
ment T 3 ,( 2 ,o),s (shaded origin). The structuring element is shown in (a) and the 
schematic of the algorithm in (b) 
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3.4 Extension to 3-Dimensional Images 

This extension is trivial in the sense that the very nature of the algorithm is 
l-dimensional. In practice however, the implementation of a sweeping procedure 
allowing for the processing of the image volume along discrete lines while avoid- 
ing processing a voxel more than once requires a very careful analysis of the 
translation directions and handling of the image borders. The original position 
(image corner) of the line and the two translations to consider depend on the 
orientation of the line (there are 16 cases). 

4 Line Segment Cascades 

By cascading two erosions (resp. dilations) with vertical and horizontal line seg- 
ments, one achieves erosions (resp. dilations) with square structuring elements: 

where is a square a width n pixels. It has long been know that discrete 
diamond-shaped structuring elements cannot be generated by cascading erosions 
(resp. dilations) with structuring elements at 45 and —45 degrees. Although 
logarithmic decompositions have been proposed [33] for speeding up operations 
with diamond-shaped sets, the following simple and efficient decomposition can 
be used instead: 



On — © O 2 , (9) 

where is the diamond-shaped structuring element with a side of n pixels (n > 
2), i.e., 02 is the 4-connected neighbourhood plus its central pixel. By definition, 
Oi is a single pixel. An example is shown in Fig. 11. All these decompositions 




y 







Fig. 11. Decomposition of a diamond-shaped structuring element of a width of 
4 pixels using Eq. 9 



substantially speed up operations for n large enough (6 (resp. 10) min/max 
comparisons per pixel whatever the width n of the square (resp. diamond) when 
using the recursive algorithm instead of a number in 0(n^) for the brute force 
algorithm). Note also that cascades of erosions/dilations by 02 currently used 
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for generating operations with diamond- shaped structuring elements of larger 
size are not only less efficient than Eq. 9 but also only lead to diamonds of odd 
width. In the sequel of this section, we concentrate on cascades starting from 
periodic lines and leading to connected lines and discrete disks. 

4.1 Leading to Connected Lines 

Cascades of periodic lines can be used for generating connected line segments and 
granulometric families of disks. This idea has originally been developed in [12]. 
We summarise here the main results. It is apparent from the example shown 
in Fig. 3c that in general a periodic line is not connected. In fact, in the two- 
dimensional case using 8-connectivity, the only examples of connected periodic 
lines are when both the horizontal and vertical components of the periodicity are 
either —1, 0, or 1. However, periodic lines can be cascaded with other periodic 
lines or carefully chosen structuring elements to form a connected line (this 
principle it at the very basis of Eq. 2). Indeed, if Pa,v is a periodic line, where 
A > 1, and A is any connected line with end points given by 0 and v, then L\ = 
A © Pa,v is a connected line with the same end points as Pa,v An example is 
shown in Fig. 12 using the periodic line ^ 3 ,( 2 ,!) from Fig. 3c. In Fig. 12a are 



Fig. 12. Connected periodic lines, (a) Examples of the connected line A for the 
periodicity (2, 1). (b) Corresponding connected periodic lines L 2 = A (B ^ 2 , ( 2 , 1 ) 

examples of the line A that have the two end points (0, 0) and (2,1). Note that 
the line A may be of any type so long as it is connected; the first two we show 
here are 8-connected Bresenham lines and the third is 4-connected. Although it 
would be advantageous if the connected line A was also periodic, as this would 
admit a fast and translation-invariant implementation of A, this is only possible 
for a restricted class of periodicities. The corresponding connected periodic lines, 
given by the cascade A © ^ 2 ,( 2 ,!)? are shown below each line A in Fig. 12b. 

The granulometric properties of periodic shapes are summarised by the fol- 
lowing theorem [12]: Given any set A and periodic shape 5 a, v, the opening 
-Tx(/) = admits < Fx, where A, ^ > 0. It is therefore a 

granulometric function with size vector A. 

A particular application is to linear granulometries. Since the line Pa,v is an 
example of a periodic shape 5 a, v, we may state the following: If P\(/) = 7 La (/)? 





(a) 




(b) 
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where L\ is a connected periodic line, then Fx is a granulometric function with 
size parameter A > 1. Figure 13 shows an example using the periodic line i^ 3 ,( 2 ,i) 
from Figs. 3 and 12. Figure 13a is a connected line A with end points (0,0) 
and (2,1). Figures 13a, b and c respectively are the connected periodic lines 
^ ^ © ^ 2 , ( 2 , 1 ) and A ® ^ 3 , ( 2 , 1 ) which are used to generate the first 

three members of the family of the granulometric function Ia(/) = 7La(/)- Note 



(a) (b) (c) <d) 

Fig. 13. Granulometric functions can be generated from a connected periodic 
line, (a) Connected line A. (b), (c) and (d) Growth of the connected periodic 
line 



that the increase in length of L\ equals k pixels, the periodicity of the underlying 
periodic line. By considering the union of openings described in Sec. 3.2, one can 
however increase the length of the structuring element one pixel at a time while 
satisfying all axioms of a granulometry. The resulting directional granulometry 
r for a given slope y/x and size parameter A is then defined as follows: 

i=k 

r\,{x,y) = \J (10) 



4.2 Leading to Discrete Disks 

In Euclidean morphology, Matheron [15, p. 94] has shown that for ^i, ^ 2 , • • • On 
distinct in [0,7t), /ci, /c 2 , . . . ^kn > 0, then kiLo^ (Bk2Le^ ©• • -(BknLe^ is a convex 
polygon of 2n sides whose opposite edges are of length 2ki and have orientation 
given by 6i. Adams [1] used this principle for generating disks (and spheres) of 
increasing size from cascades of dilations by discrete Bresenham line segments. 
In [12], it has been show that cascades of periodic lines lead to better results 
in the sense that the resulting disks are symmetric. The approximation of a 
Euclidean disk (ball) B can be written as follows: B ^ However, no clue 

was given on how to select the vector of sizes and the corresponding vector of 
periodic lines. This can be achieved as follows. Suppose we look for the best 
discrete approximation of an Euclidean disk using cascades of line segments 
whose Euclidean length equals 1. We then consider Eig. 6 and look for all slopes 
defined for this length and use the corresponding periodic lines with the maximal 
number of pixels so that the corresponding Euclidean line segment does not 
exceed 1. We denote by Bi the corresponding approximation. Eor example we 
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have the following cascades for the five first approximations: 

= -^2, (1,0) © ^2, (0,1): 

~ -^ 2 , ( 1 , 0 ) © ^ 2 , ( 0 , 1 ) © ^ 2 , ( 1 , 1 ) © ^ 2 , ( 1 ,- 1 ): 

B 2 = ^ 3 , ( 1 , 0 ) © ^ 3 , ( 0 , 1 ) © ^ 2 , ( 1 , 1 ) © ^ 2 , ( 1 ,- 1 ): 

= ^3, (1,0) © ^3, (0,1) © ^2, (1,1) © ^2, (1,-1) 

©^ 2 , ( 2 , 1 ) ©^ 2 , ( 2 ,- 1 ) ©^ 2 , ( 1 , 2 ) ©^ 2 , ( 1 ,- 2 ): 

B 2 V 2 ~ ^ 3 , ( 1 , 0 ) © ^ 3 , ( 0 , 1 ) © ^ 3 , ( 1 , 1 ) © ^ 3 , ( 1 ,- 1 ) 

©^2, (2,1) © ^2, (2,-1) © ^2,(1,2) © ^2, (1,-2)- 
Figure 14 displays the eight first disks. Notice that, by construction, these disks 



^V2 



B 2 



B 2 V 2 ^3 B^ 

Fig. 14. Radial decompositions of discrete disks of increasing size using cascades 
of dilations with periodic lines. Each disk is symmetric, convex (see Sec. 6 ), and 
is opened for all disks of smaller size: jBi{Bj) = 0 for all i < j (see text for the 
definition of each Bi) 




form a granulometry with size parameter given by 1. In contrast, radial decom- 
positions using Bresenham lines [1] cannot be used to generate a granulometric 
function, neither the disks obtained by thresholding the Euclidean distance com- 
puted from a center pixel (the first element would be 02 and the second ^3 but 
it is not opened by the first, i.e., the absorption property is not satisfied). The 
generation of a granulometric family of discrete spheres S from cascades of pe- 
riodic lines can be achieved using a similar approach. Eor example, we have the 
following two first decompositions: 

Si = T2, (1,0,0) © ^2, (0,1,0) © -^2, (0,0,1): 

0 P2, (1,1,0) © ^2,(1, -1,0) 

©^ 2 , ( 0 , 1 , 1 ) © ^ 2 ,( 0 , 1 ,- 1 ) © ^ 2 , ( 1 , 0 , 1 ) © ^ 2 ,( 1 , 0 ,- 1 )* 



92 



Pierre Soille 



5 Opening/ Closings by Line Segments 

5.1 Basics 

Directional openings and closings are useful for a wide range of practical appli- 
cations such as fingerprint and fibre analysis, document interpretation (where 
thin lines have to be tracked), industrial inspection (where cracks, scratches, 
and other elongated defects have to be detected), and satellite imagery for the 
detection of road networks. A bank of directional openings and/or closings can 
also be used for extracting long thin objects of an image. This approach is illus- 
trated in Fig. 15 for the extraction of bus-like structures in a 1 metre resolution 
satellite images. Other applications of directional morphological filters are de- 
tailed in [25] for the processing of veins appearing on plant leaves and [35] for 
the filtering of laid lines appearing in paper watermarks. When the elongated 
structures contains gaps, better results are obtained by considering rank- max 
directional openings [31]. The recursive algorithm described in Sec. 3 is then 
replaced by the moving histogram algorithm [5,10] suited for rank filters. 

5.2 Orientation Field 

The orientation at a given pixel x in an image is defined as the orientation of the 
line segment that minimises the difference between the grey-level value in the 
original image at x and the grey-level value at the same location in the image 
filtered by the considered line segment [30,31]. Openings should be used for 
image structures that are brighter than their background (i.e., ‘positive’ image 
structures) and closings for image structures darker than their background (i.e., 
‘negative’ image structures). 

In mathematical terms, we define the positive orientation at a given image 
pixel X and for a given scale A as the orientation of the directional morphological 
opening of length A which modifies the least the original image value at posi- 
tion X. We denote the the positive orientation by Dif^ ^ the negative orientation 
Dir~ being defined by duality: 

Dirlif){x) = {6i I > 7LA.9,(/)(a;), V 6 >i 7 ^ Oj}, ( 11 ) 

Dir'^ {f){x) = {Oi I if){x) < yOi^ Oj}. (12) 

The positive (resp. negative) directional signature at a given pixel can be 
obtained by plotting the normalised opened (resp. closed) values versus the ori- 
entation of the line segment. This signature can then be used to detect crossing 
lines, flat zones, etc. We also define the following quantities for each point x of 
the input image /: 

Max+{f){x) = {7L;,,«.(/)(a;) | 7 La,«, (/)(a;) > 'rL^,g.if){x), V6<i 7^ Oj}, (13) 
Minl{f){x) = {lL^,e.{f){x) I 7L;,,«.(/)(a;) < 7L;,,«.,;,(/)(a:), V6<i 7^ 6 j}, (14) 
Gdirl{f){x) = Maxl{f){x) - Minl{f){x). (15) 



Morphological Operators with Discrete Line Segments 



93 





(a) Input satellite image (IKONOS) showing Pi- 
azza Venetia in Roma 



(b) Union of openings by line segments slightly 
shorter than buses (followed by reconstruction of 
original image) 




\ ^ 

I 




V 






\ 



I 



(c) Union of openings by line segments slightly (d) Difference between image (b) and (c) 
longer than buses (followed by reconstruction of 
original image) 



\ 




\ 




\ 



\ 



(e) Global threshold of (d) 

Fig. 15. Extraction of bus-like structures appearing in a 1 metre resolution satel- 
lite image (IKONOS) using union of openings by line segments 
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Gdif^ can be interpreted as the strength of the positive orientation: it will output 
a small value if there is no predominant orientation for a structuring element 
length of A pixels. Max^^ Min^, and Gdir^ are defined by replacing the opening 
with the closing in Eqs. 13-14. 

When comparing the values of Gdif^ and Gdir~ of a given pixel, it is possible 
to detect whether it belongs to a positive or negative image structure: positive, 
if Gdif^ > Gdir ~ , negative, otherwise. We denote by Gdir the point- wise maxi- 
mum between the images Gdif^ and Gdir ~ : Gdir = Gdir'^ V Gdir ~ . The image 
of directions Dir is then defined as follows: 



Din{f){x) = I 



Dirxif){x), if Gdirxif){x) = Gdirl{f){x), 
Dir^{f){x), otherwise. 



(16) 



A colour representation of the local orientation information is then simply 
achieved by equating the orientation information (i.e., either Dir'^ , Dir~ ^ or 
Dir) to the hue and strength of the orientation information (i.e., either Gdir~^ ^ 
Gdir~ ^ or Gdir) to the lightness component of the image, the colours being 
fully saturated in all cases. For example, the colour representation of the local 
orientation by opening is illustrated in Fig. 16 on IRS satellite image of Athens. 



6 Closing by a Half-Plane and Convex Hulls 

In [22], we have been shown that the convex hull transformation can be de- 
fined in terms of an intersection (point-wise minimum A) of half-plane closings. 
Hence, denoting by (j) the closing transformation, a closed half-plane having 
a given slope 0 = arctan(^/x), and the reflected half-plane, the convex hull 
transformation CFT of a grey scale image / is defined as follows: 

Cif(/) = A[0^.(/) A 0^,(/)]. (17) 

0 

In the discrete case and for a bounded image, only a finite number of directions 
need to be considered. More precisely [24], let us first define the convex hull 
CHq of order 0 as the intersection of the horizontal and vertical half-planes (i.e., 
slopes in the form 0/1 and 1/0). The convex hull of order 0 is nothing but the 
strong convex hull in the 4-connected graph. The convex hull CHi of order i G N 
is then defined as the intersection of all half-planes whose slopes are in the form 
oiyjx where x and y are integers in the range [—i^i] with no common divisors 
other than 1. The corresponding number of slopes is given by 4 card(Fi) — 4 (see 
also Sec. 2.3). Since the orientations considered for any order i > 0 is a subset of 
the orientations considered for the order i -h 1 , the following ordering relationship 
is satisfied: CHi^i < CHi. 

For a n X n image, convergence is reached at the latest for the order n — 1 
(upper bound). The actual order number depends on the shape of the image 
objects. In practice, the number of half-plane closings corresponding to the upper 
bound may be too large for applications where speed is an issue. In this case. 
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Fig. 16 . Image of the local orientation enhancing the road network of a city. 
Top: input IRS image of Athens. Bottom: colour representation of Dir~ . The 
structuring element used in this experiment is a line segment of 11 pixels using 
the moving histogram TI implementation and a rank of 4 for the corresponding 
rank-max openings 
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a trade-ofF between accuracy and computation time must be considered. For 
example, a convex approximation of the discrete convex hull of order n > 1 is 
achieved by performing half-plane closings whose slopes are in the form ±z/n 
and ±n/i, where i G {0, . . . , n} (there are only 4n — 4 such slopes). We denote 
by CHn the corresponding approximation. Notice that the following ordering 
relation holds: CHn < CHn- In [24], we show that if CH^ outputs the exact 
discrete convex hull (i.e., CH = CH^), then CHq>w \ CH is at most one-pixel 
thick. 

Finally, note that the translation-invariant implementation of half-plane clos- 
ings can be achieved by processing the pixels in the order they are reached when 
progressively translating each Euclidean half-plane so as to sweep the whole 
image definition domain. By doing so, the pixels reached at any given step cor- 
respond to those falling along the periodic line having the same slope as the 
half-plane. 

7 Conclusion 

The study of discrete line segments for morphological operators leads to a wide 
variety of developments of interest to both discrete geometry and image analysis. 
The resulting structuring elements are useful not only for filtering thin image 
structures but also for generating sound approximations of discrete disks and 
grey scale convex hulls. 
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Abstract. This paper is a continuation of our works [12,13,15,16,17,18] 
[21,22] in which we study the properties of a new framework for dis- 
cretization of closed sets based on Hausdorff metric. Let F be a non- 
empty closed subset of S C is a Hausdorff discretization of F if 
it minimizes the Hausdorff distance to F. 

We study the properties of Hausdorff discretizations of algebraic sets. 
Actually we give some decidable and undecidable properties concerning 
Hausdorff discretizations of algebraic sets and we prove that some Haus- 
dorff discretizations of algebraic sets are diophantine sets. We refine the 
last results for algebraic curves and more precisely for straight lines. 

Keywords: Algebraic set, diophantine set, Hausdorff discretization, ho- 
mogeneous metric. 



1 Introduction 

We have introduced a new framework for discretization of closed sets based on 
the Hausdorff metric : Hausdorff discretization. The basic idea is to select as 
possible discretizations of a Euclidean set F all discrete sets S such that the 
Hausdorff distance between F and S is minimal. This leads to several possible 
choices for such a discretization. 

Our framework is as follows: given a metric d on let F be a non-empty 
closed subset of then S C is a Hausdorff discretization of F if it mini- 
mizes the Hausdorff distance Hd{S,F) to F. Some geometrical and topological 
properties of Hausdorff discretization and its comparison with other discretiza- 
tion schemes are studied in the serie of papers [12,13,15,16,17,18,21,22]. 

In this paper, we study the Hausdorff discretizations of particular sets : real alge- 
braic sets. After recaling the fundametal properties of Hausdorff discretizations 
and the refinement of their properties for the class of homogeneous metrics, 
we give some decidable and undecidable properties concerning Hausdorff dis- 
cretizations of algebraic sets and we prove that some Hausdorff discretizations 
of algebraic sets are diophantine sets. Some of these results are proven by using 
the Tarski’s elimination algorithm concerning the decidability of first order logic 
for elementary algebra [9,14,19,20]. The time complexity of Tarski’s elimination 
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algorithm, is in the general case, very high [5]. We refine also the last results 
for algebraic curves and more precisely for straight lines. Actually we prove that 
the Hausdorff discretizations of straight lines can be discribed by using two pe- 
riodic diophantine sets. This last result generalize the results of [1] concerning 
the supercover discretization of straight lines. 

In this paper, for simplifying the notation in the notions which depend on a 
metric d, we do not refer explicitely to d, except when there is an ambiguity. 



This paper is divided into eight sections. In the second section we briefly 
recall classical notions of metric space and Hausdorff space. In the third section, 
we introduce the Hausdorff discretizations. Section 4 deals with homogeneous 
metrics, while Section 5 introduc real algebraic sets, real semi- algebraic sets and 
diophantine sets. In the sixth section we characterize some Hausdorff discretiza- 
tions of real algebraic sets. In the seventh section, we refine the last results for 
algebraic curves and more precisely for straight lines. The last section is a con- 
clusion. 



The proofs are not given here, the proofs the the results of sections 2, 3 and 4 
can be found in [12,15]. 

2 Some Metric Notions and Hausdorff Metric 

We assume that the reader is familiar with classical notions of topological space, 
metric space and normed space, see for example [3,6,7]. We introduce here our 
notations, most of them are recalled in [12,15,21,22]. 

Definition 1. Let (L,d) be a metrie spaee, and let p ^ 8 and r G , 

B^{p) = {x e 8 \ d{x,p) < r}. 

Bf{p) is ealled the ball of center p and of radius r. 

Let E Q 8. int{E) = {x ^ E \ 3r > 0, Bf{x) C E}, int{E) is ealled the interior 
of E (i.e. int{E) is the largest open set eontained in E ). 



In all the following, all topological notions in a metric space {8, d) are considered 
relatively to the topology induced by d. All metrics used in this paper are induced 
by norm. So if A' is a norm over 8^ then the function d^ such that: Vx, ^ G 
8^ dj^ix, y) = N{x — y) is called the metric over 8 induced by N . 

Example: 

Let X = (xi, X 2 , ..., Xn) G IR^ . Then Vp > 1, \x\p = ^|xi|^ + ... + |xn|^ and 
l^loo = max\<i<n\xi\ are a norms over 8 = IR^ . The metrics induced by these 
norms are denoted dp and doo respectively. 
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2.1 Hausdorff Metric 

The definitions and results presented in this subsection can be found in [3,6]. 

Definition 2. Let {S, d) he a metrie spaee; H{8) is the set of non-empty eompaet 
subsets of 8, T[£) the set of elosed subsets of 8, and T\8) the set of non-empty 
elosed subsets of 8(i.e. T' {8) = T{8^ \ {0}y)- 

On 1~L[8\ we will define a metric such that if (8^d) is a complete metric 
space then {l~l{8), Hd) is a complete metric space. 

Definition 3. Let (8,d) be a metrie spaee and let A,B ^ H{8). We define the 
oriented Hausdorff nnetric from A to B by hd{A^B) = maXaeA{d{a, B)) where 
d{a,B) = mini)^B{d{a^b)). 

Definition 4. Let {8, d) be a metrie spaee. The Hausdorff distance between two 
eompaet sets A, B e H{8) is defined by Hd{A,B) = max{hd{A,B),hd{B, A)). 

Remark: 

Let T' {8) be the set of non empty closed set of 8. Then, the functions hd and Hd 
can be extended in natural way as function from T\8) x Tf8) to U {+oo} 
(note: in Definition 3, the “max” (resp. “min”) becomes a “sup” (resp. “inf” ) ) 



Hd is a “generalized metric” on T' {8) in the sense that it satisfies the axioms of 
metric, but can take infinite values. 

3 Hausdorff Discretization 

In this section, we recal our framework of discretization based on Hausdorff 
metric. Our results are proved in [15]. In the rest of this paper we assume that 
we have as metric space d), where is a metric induced by a norm on 
and as a discrete space Vp = for p > 0. For a such distance, M ^ Vp 

implies that M G 

In all the following, we assume that p > 0. 

Definition 5. Let d be a metrie on and p > 0. The covering radius of the 
metrie d in Vp is 

rdp) = supxeK"{d{x,Vp))- 

Notation: 

Let (i be a metric on IR^ and r > 0; 

yp e P'ilRn, p) = {peVp\ Bt{p) nP^^- 

Let T be a non-empty closed subset of IR^] M C is a Hausdorff discretiza- 
tion of F in Vp if it minimizes the Hausdorff distance to F. In this section, we 
study the properties of Hausdorff discretizations. In [12,13,21,22], we have stud- 
ied the Hausdorff discretizations when T is a compact set. 
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Definition 6. Let F G 

• A set M <ZVp is a HausdorfF discretization of F in Vp if 
Hd{F,M) = inf{{Hd{F,S) \ S C Vp}). 

. Mh{F,p) = {M ^Vp I Hd{F,M) = znfscv,{Hd{F,S))} is the set of 
H aus dor jf discretizations of F . 

• Ah{F^p) = {{JMeMniF p) called the nnaximal HausdorfF discretization 

of F in Vp (i.e. The justification of this definition is given in the following 
Theorem ). 

• The value rniF^p) = supxeF{d{x,Vp)) is called the HausdorfF radius of F in 
Vp for the metric d. 

We characterize now the Hausdorff discretizations. 

Theorem 1. Let F G F\1R^); then: 

• AAh{F^p) is nonvoid and for M G M.h{F^p), Hd{F^M) = rniF^p); 

• for a family {Mi)i^i of members of Mh{F^ p), G Mh{F^p) and so 

^h{F,p) g Mh{F,p); 

• if {Mn)neiN is a decreasing sequence in M.h{F^p) (relatively to the set in- 
clusion ) then rinew 9 and flncw C Mh{F, p); 

• Ah{F, p) = {pFVp\ d{p, F) < vh{F, p)}; 

• AAh{F^p) is the set of all M F Vp such that M C Ah{F^p) and F is 
included in the union of balls of radius rniF, p) centered about points of M ; 

• rniF.p) < rc{p); 

. Ziff(F,p) = ZiG(f,,)(Cp). 

Definition 7. Let p G Vp, we define the the cell of center p as 
C{p,p) = {xeM^ \ doo{x,p) < |} 

Note that cells are closed and overlap only at their boundaries. 

In Figure 1 we illustrate the construction of Hausdorff discretizations for a closed 
set F: computing the Hausdorff radius (maximal distance from points of F to the 
discrete space), one takes for Ah{F, p) all discrete points p such that the ball of 
center p and Hausdorff radius intersects F; any subset M of Ah{F, p) such that 
the corresponding balls for p G M cover F, will be a Hausdorff discretization. 

Definition 8. Let d be a metric on IR^ and F G the skeleton of 

Ah{F,p) is the set 

sk{F,p)= Pi m. 

MeMniF^p) 

Note that Sk{F, p) is not generally a Hausdorff discretization of F. 
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Fig. 1. Left: The set T = A U 5 U C overlayed with discrete points p,q,r,s 
and their square cells Right: For d = d ,2 (the 

Euclidean distance), the maximal Hausdorff discretization of F is {p, 5 }; 

indeed, we show the circles of radius th{F^ 1) centered about these points. The 
unique other Hausdorff discretizing set of F is {p, < 7 , s} 

4 Homogeneous Metric and Hausdorff Discretization 

We present some properties of a homogeneous metric and we refine the char- 
acterization of Hausdorff discretizations for such metrics. Actually, we study 
the relationship between Hausdorff discretizations and covering discretizations. 
Again, results of this section are proved in [15]. 

Definition 9. A norm N on is homogeneous if for every (xi,...,Xn) G 
IR^ , every (^i, ..., ^ and for every permutation cr 0 / {!,..., n}, we 

have A/'(eiXcr(i) , ..., £n^cr(n)) = ...,Xn). A metrie indueed by a homogeneous 

norm is ealled a homogeneous metrie. 

Theorem 2. Let d he a homogeneous metrie indueed by a norm N, then: 

• rc(l) = ^^'(l, ..., 1) and 

• C C for all 
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Examples: 

1 

In Vp > 1, (ip is a homogeneous metric and thus, rc(l) = (^), and doo is 
also a homogeneous metric and thus, rc(l) = 

Definition 10. Let E C IR^ , a subset S C Dp is ealled a covering discretization 
of E in Vp, if\/p e S, E nC{p, p) / 0 and E C C(p, p) 

An example of covering discretization is the supereover diseretization Asc 
which associates to every X G V{1R^) the set of all p e Vp such that C{p,p) 
intersects X (see Figure 2): 

VX c iR-, p) = {pe Vp I C(p, p)nX^ 0}. 




Fig. 2. (a) The cell C{o,l). (b) A Euclidean set X overlayed with the discrete 
points p and their cells C(p, 1). (c) Asc{^^ 1) 



Theorem 3. Let d he an homogeneous metrie and let F G T\1R^). If S is a 
eovering diseretization of F in Vp, then S G Mh{F,p). 

Definition 11. A metrie d on is ealled strictly homogeneous if d is homo- 
geneous and n 1) = {(5,5)}. 

In other words, the balls of covering radius centered about diagonally adjacent 
discrete points intersect only at their corners. For example dp is strictly homo- 
geneous for all p > 1 and for p = oo. 

5 Algebraic Sets and Diophantine Sets 

In this section, we give the definitions of real algebraic sets, real semi- algebraic 
sets and diophantine sets. We recall also the decidability of first order logic for 
elementary algebra corresponding to Tarski’s elimination algorithm. 

Definition 12. A subset A of IR^ is an algebraic set, if there exists a polynomial 
P G lR[Xi, Xn] sueh that A is the set of zeros of P in IR^ (i.e. A = {x ^ 

I p(x) = 0} ;. 

If the eoefficients of P are integers (i.e. P G Z[Xi, ..., Xn]) then A is ealled a 

^-algebraic set. 
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Definition 13. A subset S of IR^ is a semi-algebraic set^ if it can be written as 
a finite union of set of the form: 



where Pi, P/, Qi , Qm •••, Xn]- If the coefficients of polynomials 

involved in a definition of the semi- algebraic set S are integers (i.e. Pi^Qi G 
Z[Xi, ...,Xn]) S is called a ^-semi-algebraic set. 

Actually every subset of IR^ defined by a boolean combination (obtained by 
disjunction, conjunction and negation) of atomic formulas of the form P{x) > 0, 
P{x) < 0 or P{x) = 0, where P G lR[Xi, is a semi-algebraic set of 

Definition 14. A subset D of is a diophantine set, if there exists a poly- 
nomial P G Z[Xi^ Xn] such that D is the set of zeros of P in (i.e. 
D = {xeZ^ I P(x) = 0} 

Remarks: 

• A finite intersection of algebraic (resp. diophantine ) sets is an algebraic (resp. 
diophantine) set. Actually if Ai, ..., are in IR (resp. in Z) then 

Ai = 0 and A 2 = 0 and ... and A/^ = 0 if and only if A^ -h A 2 + ... + A^ =0. 

• A finite union of algebraic (resp. diophantine ) sets is an algebraic (resp. dio- 
phantine) set. Actually if Ai, ..., A/c are in IR (resp. in then 

Ai = 0 or A 2 = 0 or ... or A/c = 0 if and only if A 1 .A 2 ...A/C = 0. 

In the following theorem, we present the result corresonding to Tarski’s elimi- 
nation algorithm which proves the decidability of first order logic corresponding 
to elementary algebra. 

Theorem 4. [ 2 , 14 , 19 , 20 ] 

Let P\{Xi, ...,Xn,Y), ...,Pk{X\, ...,Xn,Y) be a sequence of polynomials in n -\- 
1 variables with coefficients in Z. Then there exists an effectively computable 
boolean combination B{Xi , ..., Xn) of polynomial equations and inequalities in the 
variables Xi, ...,Xn with coefficients in 'Z , such that for every (xi, ...,Xn) G , 
the system of polynomial equations and inequalities 



where )-^G {=, >, >} for i = 1 , ..., k, 

has a solution y in M if and only if 8{xi, ...,Xn) holds true in M. 

Corollary 1. • The boolean combination B in the last theorem define a semi- 

algebraic set (i.e. {x G IR^ \ B{x) holds true in M} is a semi- algebraic set 



{x G I Pi(x) = ... = Pi{x) = 0, Qi{x) > 0, ..., Qm{x) > 0}, 




ofM^ ). 
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• Let F be a field whieh is a finite extention of Q and let Y") 

be a boolean eombination of polynomial equations and inequalities in the 
variables Xi, ...,Xn,Y with eoeffieients in F; then there exists an effeetively 
eomputable boolean eombination >B"(Xi, X^) of polynomial equations and 
inequalities in the variables Xi,...,Xn with eoeffieients in F, sueh that for 
every C IR^ , there exists y e IR sueh that B' ...^Xn^y) holds 

true in IR if and only if B"{x\^ holds true in M. B" is obtained from 

B' by using Tarski’s elimination algorithm eorresponding to transformation 
rules permitting the elimination of the variable Y. 

6 Hausdorff Discretizations of Algebraic Sets 

In this section we study the Hausdorff discretizations of real algebraic sets. We 
prove that some Hausdorff discretizations of real algebraic sets are diophantine 
sets and we give some decidable and undecidable properties concerning Hausdorff 
discretizations of real ^-algebraic sets. 

Definition 15. A metrie d is ealled algebraic (resp. ^-algebraic^ if the unit ball 
of d is a semi-algebraie (resp. Z-semi-algebraie ) set. 

For example dp is ^-algebraic for all p G (Q such that p > 1 and for p = oo. 

The following property is a consequence of the undecidability of diophantine 
equations ( Hilbert’s Tenth Problem ) [10]. 

Property 1. Let n > 3. The following problem is undecidable: 

Is rniA, p) = 'Tc(p) for a given ^-algebraic set A of IR^ ^ a ^-algebraic metric d 
on WF and p G 

The following property is obtained by successive applications of Tarski’s elim- 
ination algorithm, to a polynomially constrained formula Fn whith n unknows 
corresponding to the problem, until obtaining a formula Fq without unknowns. 

Property 2. The following problem is decidable: 

Is p G Af{A^ p) for a given p G pZ^ ^ a ^-algebraic set A of IR^ ., a ^-algebraic 
metric d on IR^ and r, p G 

Theorem 5. Let A be a Z-algebraie set of and d be a Z-algebraie metrie 
on IR^ , then 

• ^Af{A^p) is a diophantine set for all r^p ^ . 

• ^Sk(Ajp) is a diophantine set for all p G . 

Corollary 2. Let A be a Z-algebraie set of and d be Z-algebraie metrie 
on , then 

• The maximal Hausdorff discretization Ah{AA) of A is a diophantine set 
and ^Ah{A,p) is a diophantine set for all p G . 

• The supereover discretization Asc{AA) of A is a diophantine set 
and ^Asc{Aj p) is a diophantine set for all p G . 
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7 Hausdorff Discretization of Algebraic Curves 

In this section we study the Hausdorff discretizations of ^-algebraic curves 
(i.e. a -algebraic set of M^). We refine this study for straight lines by using 
Fourier’s elimination algorithm because some of the constraits corresponding to 
this problem are linear. So Fourier’s elimination algorithm is a special case of 
Tarski’s elimination algorithm. Our result concerning Hausdorff discretizations of 
straight lines generalize the results [1] concerning the supercover discretization of 
straight lines. Several properties of Hausdorff discretizations of straight lines are 
decidable because these discretizations are defined by linear constraints, which 
is not the case for Hausdorff discretizations of non linear algebraic sets of 
for n > 3 because, it corresponds to diophantine equations on n unknows [10]. 
Notice that the decidability of a diophantine equation in two unknowns is still 
an open problem. 

Theorem 6 . Let p G IR^ and let C be a -algebraic curve such that the radius 
of curvature of C is strictly greater than p and let d be an homogeneous and 
Z -algebraic metric, then 

• If '^h{C,p) < rc{p), then there exists two diophantine sets D\ and D2 such 
that 



M G p) if and only if M = p{Di U S) with S C D2. 

• If '^h{C,p) = rc{p) then there exists tree diophantine sets Dq, D\ and D2 
such that 

M G A4h{C, p) if and only if M = p{Dq U U S2) with Si C Di, S2 ^ D2 
and ifp G {Di\Si), then there exists S ^V4,{p),S C S2, ^ ^ 

allqeS andCnC{p,p) C [jqeS ^Lp) (q), where V/i{p) is the ^-neighbourhood 
of p in pZ^ . 

Corollary 3 . Let p G IR^ and let C be a Z -algebraic curve such that the ra- 
dius of curvature of C is a strictry greater than p and assume that d is strictly 
homogeneous and Z -algebraic metric, then there exists two diophantine sets D\ 
and D2 such that, 

M G Mh{C, p) if and only if M = p{Di U S) with S C D2. 



7.1 Hausdorff Discretizations of Straight Lines 

In this subsection we study the Hausdorff discretizations of straight lines for 
homogeneous metrics. Some of the results of this subsection are proved by us- 
ing Fourier’s elimination algorithm [4,8,11] which transforms linear constraints. 
Fourier’s elimination algorithmis is a special case of Tarski’s elimination algo- 
rithm. The time complexity of Tarski’s elimination algorithm, is in the general 
case, very high [5]. 
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Before giving the results corresponding to Hausdorlf discretizations of straight 
lines, we treat two special cases of horizontal and vertical straight lines. 

Let L be a straight line such that L = {(x, G \ x + p{m + |) =0} (resp. 
L = {{x,y) G I y + p{m + |) = 0} ^ for m G then rniL.p) = rdp) 
and VM G M-Hdi^^p) Vi G if (pm^i) ^ M then (p(m + l),i) G M ( resp. 

Vi G if (i, pm) ^ M then (i, p(m + 1)) G M ). 

In all the following, we assume that the considered straight lines are not of the 
last forms. 



Definition 16. S C is a periodic set if there exists p G and p ^ (0,0) 

sueh that q p G S for all q E S. 

Theorem 7. Let C = {(x^y) G \ ax by c = 0} sueh that a^b,c G p'Z 

1 - If d is an homogeneous metrie (the unit ball of d is not neeessarily a semi- 
algebraie set) and rniL, p) < r^p), then there exist two periodie diophantine 
sets Di and D 2 sueh that 

M G (£, p) if and only if M = p{Di U S) with S C D 2 . 

2 - If d is a strietly homogeneous metrie (the unit ball of d is not neeessarily a 
semi-algebraie set), then there exist two periodie diophantine sets D\ and D 2 
sueh that 

M G Al/f (£, p) if and only if M = p{Di U S) with S C D 2 . 

Sketch of proof: 

This theorem is a consequence of the following two facts: 

C = {(x, y) G I ax + + c = 0}, 

• Let r > 0 and Dr{C,p) = {(m,n) G pZ^ \ C ^ int{C{(m,n),r)) ^ 0}. 

So, Dr{C, p) is the set of (m, n) G p!Z^ such that there exists t,t' G IR and 

' a{m + t) + a{n + t') + c = 0 
t > 

< -^>-i 




So, Dr{C, p) = {(m, n) G p!Z^ \ — §(|n| + |^|) < am-\-bn-\-c < |(|a| + |6|)}. Thus, 
^Dr{C, p) is a diophantine set and if (m, n) G Dr{C, p) then (m, n) + z{b, —a) G 
Dr{C, p) for all z ^ Z. 

• Let r > 0 and (m,n) G pZ^ such that Bf{m,n) ^ C ^ tl), then Bf(fm,n) + 
z(b, — a))n£ 7^ 0 for all z ^ Z. Moreover, if for some r', C^int{C{{m, n),r')) = 0, 
then C n int{C{{m, n) + z{b, —a),r')) = 0. 



In Figure 3, we illustrate the sets Di and D 2 of the last theorem. 
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Fig. 3. The hatched squares correspond to points of D\ and the grey squares 
correspond to points of D2 



8 Conclusion 

Throughout several papers we have introduced a new framework for the dis- 
cretization of a non-empty closed set, based on the Hausdorff distance. In this 
paper we have studed Hausdorff discretizations of real algebraic sets. Actually 
we have given some decidable and undecidable properties concerning Hausdorff 
discretizations of algebraic sets and we have proved that some Hausdorff dis- 
cretizations of algebraic sets are diophantine sets. We have refined the last re- 
sults for algebraic curves and more precisely for straight lines. Some of these 
results are proven by using the Tarski’s elimination algorithm concerning the 
decidability of first order logic for elementary algebra. 

We intend to do further investigations on: 

• effective computation of Hausdorff discretizations of non linear algebraic sets; 

• topological properties of Hausdorff discretizations of algebraic sets; 

• complexity of representation of some Hausdorff discretizations of algebraic 
sets. 
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Abstract. We study the problem of reconstructing finite subsets of the 
integer lattice from their approximate X-rays in a finite number of 
prescribed lattice directions. We provide a polynomial-time algorithm 
for reconstructing Q-convex sets from their “approximate” X-rays. A Q- 
convex set is a special subset of Z^ having some convexity properties. 
This algorithm can be used for reconstructing convex subsets of Z^ from 
their exact X-rays in some sets of four prescribed lattice directions, or in 
any set of seven prescribed mutually nonparallel lattice directions. 



1 Introduction 

The problem of reconstructing two-dimensional lattice sets from their X-rays 
has been studied in discrete mathematics and applied in several areas. In this 
context, a two-dimensional lattice set is a finite subsets of the integer lattice 
Z^, and an X-ray of a set in a direction is a function giving the number of 
its points on each line parallel to u. This problem is the basic reconstruction 
problem in discrete tomography [10] and it has various interesting applications 
in image processing [15], statistical data security [12], biplane angiography [14] 
and reconstructing crystalline structures from X-rays taken by an electron mi- 
croscope [13]. The computational complexity of this reconstruction problem is 
studied in [9]. It is shown that the question of reconstructing two-dimensional 
lattice sets from their X-rays in a set of m > 3 pairwise nonparallel directions is 
NP-hard. In most practical applications we have some a priori properties about 
the sets to be reconstructed. The algorithms can take advantage of this informa- 
tion to reconstruct the set. Mathematically, these properties can be described 
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in terms of a subclass of subsets of to which the solution must belong. For 
instance, there are polynomial-time algorithms to reconstruct hv-coiwex poly- 
ominoes (i.e., two-dimensional lattice subsets which are 4-connected and convex 
in the horizontal and vertical directions) from their X-rays in horizontal and 
vertical directions [2,5]. The class of convex lattice subsets (i.e., finite subsets F 
with T = Z^ n convT) is another well-known and studied class in discrete to- 
mography. Gardner and Gritzmann [ 8 ] proved that the X-rays in four suitable or 
any seven prescribed mutually nonparallel lattice directions uniquely determine 
all the convex lattice subsets. The complexity of the reconstruction problem on 
this class is an open problem raised by Gritzmann during the workshop: Discrete 
Tomography: Algorithms and Complexity (1997). 

In this paper, we study the problem of reconstructing Q-convex sets from 
their X-rays in a finite number of prescribed directions. The Q-convexity is a 
weak convexity property linked to a finite number of directions. The class of Q- 
convex sets contains all the convex lattice subsets. We provide a polynomial-time 
algorithm for reconstructing Q-convex sets from their “approximate” X-rays. 
This means that, the algorithm decides whether or not there is a Q-convex set 
whose X-rays all lie within prescribed bounds. If there is at least one Q-convex set 
having X-rays lying within these bounds, the algorithm reconstructs one of them 
in polynomial time. Boufkhad et al. [3] studied the problem of reconstructing 
h, 'L’-convex polyominoes from their “approximate” horizontal and vertical X- 
rays. We show that our algorithm solves this problem in polynomial time. The 
algorithm can be used for reconstructing Q-convex sets from their “exact” X- 
rays. A greedy algorithm for solving this problem has been defined in [4], and our 
new approach is faster than this algorithm for a number of directions equal to 
two and three. We point out that recently, it is proved in [ 6 ] that the uniqueness 
Gardner’s and Gritzmann’s results can be extended to the class of Q-convex 
sets. From this uniqueness result for Q-convex sets it follows that our algorithm 
can be used for reconstructing convex lattice subsets from their exact X-rays in 
some sets of four prescribed lattice directions, or in any set of seven prescribed 
mutually nonparallel lattice directions. This means that, our algorithm and the 
one defined in [4] solve Gritzmann’s problem for these special sets of directions. 

2 A Reconstruction Algorithm for Two X-Rays 

In this section, we are going to define an algorithm for reconstructing Q-convex 
sets from their approximate X-rays in two directions. The basic idea of the 
algorithm is to determine a polynomial transformation of our problem to the 
2- Satisfiability problem which can be solved in linear time [1,2,5]. 



2.1 Definitions and Preliminaries 

Let p and q be two independent linear forms on We can assume that: 
p(x, y) = ax F by and q{x, y) = ex F dy with a, h^c^d G Z, ad — he 7 ^ 0, 
gcd{a, 6) = I, gcd{c, d) = 1. Assuming that M = (xm, Vm)^ we denote p{xm, yu) 
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by p{M). The X-ray of a lattice set F along direction p{M) = const is the func- 
tion XpF : Z ^ No defined by: 

XpF{i) = card({A/' G F \ p{N) = i}). 

This, in turn, means that an X-ray of a lattice set T in a direction p is a function 
giving the number of points of F on each line parallel to this direction (see 
Fig. 2(a)). We define four zones around a point M of 7? (see Fig. 1) as follows: 

Zo(M) = {N \ p{N) < p{M) and q{N) < q{M)}, 

Zi(M) = {N I p{N) > p{M) and q{N) < q{M)}, 

Z 2 {M) = {N ez^ I p{N) > p{M) and q{N) > q{M)}, 

Zs{M) = {N eZ^ I p{N) < p{M) and q[N) > q{M)}. 

We can now introduce the definition of Q-convex set around two directions. 




Definition 1. A lattice set F is Q-convex around p and q if and only if for each 
M ^ F there exists k such that Zk{M) H F = 0 for k G {0, 1, 2, 3}. 

By the definition, if there is at least one point of F in every zone Zo(M), Z\{M)^ 
Z 2 {M) and Z^{M), the point M has to belong to F. Fig. 2(a) shows some 
examples of Q-convex sets. We point out that, from the definition it follows that 
a Q-convex set around p and is a discrete set which is convex along p and q. A 
discrete set F is convex along p if for each pair of points (M, N) of F such that 
p(M) = p{N), the discrete segment [MN] H Z^ is contained in F. Let us now 
take the following problem into consideration. 

Problem 1. Approximate Consistency with two directions 
Instance: four vectors P = (pi, . . . ,pn), P' = (jp'i, • • • Q = • • • , him), 

Q' — Wii - — iQ-'m) whose elements are no- negative integer numbers and pi, pn^ 
Pii p'n^ positivc integer numbers. 

Question: is there a Q-convex set F around p and q such that pi < XpF{i) < p[ 
for z = 1, . . . , n and qj < XqF{j) < q'- for j = 1, . . . , m ? 
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Fig. 2. a) A Q-convex set around p{x^ y) — x and q{x^ y) — y.h) A set which is not Q- 
convex set around y) — x and q{x^ y) — y. c) A Q-convex set around p{x^ y) — x-\-y 
and q{x^ y) — x — 2y 



The problem is to decide whether or not there is a Q-convex set around p and q 
whose X-rays in these two directions lie within prescribed bounds. If P = P' 
and Q = Q' , we have the Exact Consistency problem with two directions. 

In the following subsections, we determine a polynomial transformation of Prob- 
lem 1 to the 2- Satisfiability problem (2SAT). 



2.2 Q-Convexity 

The intersection of the p-line p{M) = i with the q-\me q{M) = j is not always 
in Z^. It is easy to prove that the point M intersection of p{M) = i with 
q{M) = j belongs to I? if and only if j = ni (mod S), where: S = \ad — hc\^ 
n = {cu + dv)sign{ad — be) (mod (i) and au bv = 1 (see Fig. 3(a) and [7]). 
Without any loss of generality, we can assume that a Q-convex set F around p 
and q whose X-rays are such that: pi < XpF{i) < p'- and qj < XqF{j) < q'- for 
all z, j, is contained in the lattice parallelogram: 

A = {N G Z^ I 1 < p{N) < n and 1 < q{N) < m}. 

We denote the point M G A intersection of p{M) = i with q{M) = j by 
M = (i,j) (see Fig. 2(c)). Let K = {0, 1,2,3}. We associate four boolean vari- 
ables Vk{M), with /c G AT, at every point M ^ A (i.e., one variable for each 
zone Zk{M)). The idea of the algorithm is to build a 2SAT formula APPROX 
on the variables {Vk{M))keK,MeA so that there is a solution F of Problem 1 if 
and only if APPROX is satisfiable. If there is an evaluation V of the variables 
{Vk{M))keK,MeA satisfying APPROX, the corresponding lattice set F solving 
Problem 1 is defined by function ^ as follows: 

F = ^(V) iff P = {M G zA I -W(M) is true, V/c G AT}, 

where is true if and only if Vk{M) is false. Conversely, if P is a sub- 

set of A solving Problem 1, the corresponding evaluation V of the variables 
(Vk{M))keK,MeA satisfying APPROX is defined by function P as follows: 



V = P(P) iff Vk{M) = ^^Zk{M) n P = 0”, with keK, M e A. 
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We assume that all literals outside A are true. The boolean formula APPROX is 
made up of three sets of clauses expressing: the Q-convexity (QCONV), a lower 
bound (LB) and an upper bound (UB) on the X-rays. The Q-convexity can be 
expressed with the boolean variables by the formulas: 



(VoUj) ^ Voii - S,j)) A (VoUj) ^ VoUj - 6))' 

^0= A ( ^Ao<u<s (Vo{i,j) ^Vo{i-u,j -v)^ 



M={i,j)eA 



0<v<6 
v=K,u(mod5) 



(Vi{i,j) ^ V(i + S,j)) A (V(f, j) ^ Vi{i,j - S))' 

A ( ^Ao<u<s (vi{i,j) ^Vi{i + u,j -v)) 

, . L \ . \ d<rv<rA V / 



M={i,j)eA 

22= A 

M={i,j)eA 



0<v<6 
v=K,(mod5) 



(V2{i,j) ^V2{i + A,i)) A {V2{i,j) => V2{i,j + A)) ^ 
AAo<«<5 (V2{iJ) ^V2{i + u,j + v)) 

0<v<5 ^ / 



\ tJ— ^vu(mod6) 

/ (VdiJ) => Vs{i - 6,j)) A {Vs{i,j) => Vs{i,j + 6)) 

^3= A I AAo<«<5 ^V3{i-u,j + v)^ 



M={i,j)eA 



0<v<5 
v=K,u(mod5) 



The points in the grey zone around M = (i,j) in Fig. 3(a) are the points of 
A used in Zo,Zi,Z 2 and Zs (i.e., the points {i ± S^j ± S)^ {i ± u^j ± v) with 
0 < u < 0 < u < (^). Let us denote Zq A Zi A Z 2 A Zs by QCONV. 




Fig. 3. The points around M used in Z^^Zx^Z^ and > 2^3 with p{x^y) = 2x y and 
q{x,y) = x-2y 



Lemma 1. Let V be an evaluation of the variables {Vk{M))keK,MeA satisfying 
QCONV. If M e A and W(M) is true, then W(V) is true for all N e Zk{M). 

Proof. Assume that k = 0 and let M = {iM^jM)- We have: Vo{i,j) ^ Vo{i — S,j) 
and Vo{i,j) ^ Vo{i,j — S) are satisfied, for all i,j. Therefore, by induction we 
can prove that VqQm ~ kd,jM ~ Id) is true, for all k,l G N. Let V be a point 
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of Zq{M). Let N' be the point of A such that: 
p{M)-p{N') _ p{M)-p{N) q{M 



q{M) - q{N') 
6 



q{M) - q{N) 
5 



6 6 



By the previous statement we have Vq{N') is true (see Fig. 3(b)) and, since the 
formula Vq{N') ^ Vq{N) is in Zq, we finally obtain Vq{N) is true. We proceed 
in the same way for k equal to 1,2 and 3. 

Thus, we can characterize the Q-convexity by means of the formula QCONV. 

Lemma 2. 

— For any set F G A the evaluation V = it^(F) of the boolean variables 
{Vk{M))keK,MeA satisfies the formula QCONV. 

— If an evaluation V of the boolean variables {Vk{M))keK,MeA satisfies the 
formula QCONV, then F = is Q-eonvex around p and q. 



— Assume that V = ^{F) does not satisfy QCONV. By this assumption, there 
exists k such that at least a clause of Zk is not satisfied. Then Vk{M) and 
~^Vk{N) are true, where N G Zk{M). As a consequence, Zk{M) n F = 
0, Zk{N) n F jtz 0 and Zk{N) C Zk{M). We got a contradiction and so 
V = f\f) satisfies QCONV. 

— The second statement is just a consequence of Lemma 1. If M ^ T, by the 
definition of ^ there exists k such that Vfc(M) is true. Therefore, by Lemma 1, 
we have Vk{N) for each N G Zk{M). Consequently, Zk{M) H F = 0. 

2.3 A Lower Bound 

Now we want to express that X-ray values of a lattice set in the direction p 
are greater than some prescribed integers. Let us take the line p{M) = i into 
consideration. Let qmiui = mm{j\{i, j) G A} and qmaxi = max{j|(z, j) G A}. 
Notice that, if S 1 these numbers are not always equal to 1 and m. We define 
the formula LB(p, i,l) in the following way: 



Proof 



LB{p,z,l) = TRUE 



i^l = 0 




Ai<j<m-51 {Li{j) A L2{j) A Ls{j) A T4(j)) 




LB{p,i,l) 



otherwise 
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where 

Li{j) = Vo{z,j)^^V2{z,j^Sl) 

L2{j) = Vo{z,j)^^Vs{z,j^61) 

L[ = -^V2{i,qmirii + 5{l - 1)) 

L '2 = qmirii + 6{l - 1)) 

L 3 = ~^Vo{i^ qmaxi — 8(1 — 1 )) 

L 4 = ~^V\(i^ qmaxi — 8(1 — 1 )) 

Lemma 3. 

— If a lattice set F is Q- convex around p and q and its X-ray along p is 
such that XpF(i) > I, then the evaluation V = I^(F) of the variables 
(Vk(M))keK,MeA satisfies QCONV ALB(p,i,l). 

— If an evaluation V of the variables (Vk(M))k^K,MeA satisfies QCONV A 
LB(p, z,/)^ then the X-ray of F = dh(V) along p is such that XpF(i) > 1. 

We do not show the proof of this Lemma for brevity’s sake. We define the formula 
LB(g^, j, 1) for the lines in the direction in a similar way. 



2.4 An Upper Bound 

Now we want to express that X-ray values of a lattice set in the direction q are 
smaller than some prescribed integers. For this upper bound, we need to fix two 
points A and B such that p(A) = 1 and p(B) = n. We call bases these two points 
(see Fig. 2(c)). Let us take the line q(M) = j into consideration. We introduce 
the formula: 



UB(g,j,l,4B) = IN(^) AIN(B) A A 

l<i<n-5l 
j=K,i (mod 5) 



where ; IN(M) = (M) A (M) A ^^2 (M) A (M) and 



a) If j < min{ 5 (A), 5 (B)}, 

b) lfq{A)<j<q{B) 

c) If q{B) <j< q{A) 

b) If j > max{g(^), q{B)} 



U{i) = ^Vo{i,j)^Vi{i + 5l,j) 
U{t)=^V3{i,j)^VX + Sl,j) 
U{i)=^VoUj)^V2{t + Sl,j) 
U{i) = ^V3{i,j)^V2{i + Sl,j) 
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Lemma 4. 

— If a lattice set F containing the bases A, B is Q- convex around p and q, and 
its X-ray along q is such that XqF{j) < I, then the evaluation V = I^{F) of 
the variables {Vk{M))keK,MeA satisfies QCONV A UB(g^, /, A, 5). 

— If an evaluation V of the boolean variables {Vk{M))keK,MeA satisfies the 

formula QCONV AUB(g^, j, /, B), then F = F(V) contains the bases B, 

and its X-ray along q is such that XqF{j) < 1. 

We do not show the proof of this Lemma for brevity’s sake. We define the formula 
UB(p, i,l,A,B) for the lines in the direction p in a similar way . 



2.5 The Reconstruction Algorithm 

Let (P, P', Q, Q') be an instance of Problem 1. We fix four bases A,B,C,D such 
that p{A) = 1, p{B) = n, q{C) = 1, q{D) = m and then we build the formula: 

APPROX(P, P', Q, Q', A, P, C, D) = QCONV A 
A (LB(p,i,Pi) AUB(p,i,p',C',r))) A /\ 

l<i<n 

As a consequence of Lemmas 2,3 and 4, we get: 

Theorem 1. APPROX(P, P', Q, Q' , A, P, C, D) is satisfiable if and only if there 
is a Q-convex set F around p and q containing the bases A^B^C^D and having 
X-rays along p and q such that pi < XpF{i) < p[, for i = 1, . . . , n, and qj < 
^qF{j) < <lj, for j = 1,... , m. 

Since APPROX(P, P' ^Q^Q' ^ A^B^C^D) is a boolean formula in conjunctive nor- 
mal form with at most two literals in each clause, from Theorem 1 we have 
a transformation of Problem 1 to 2SAT problem. The algorithm chooses four 
bases A, P,C, P, and builds APPROX(P, P', Q, Q', A, P, C, P). Each formula 
APPROX(P, P', Q, Q', A, P, C, P) has size 0{mn) and can be constructed in 
0{mn) time. This is a 2SAT formula and so it can be solved in 0{mn) time 
(see [1]). If the formula is satisfiable and V is the evaluation of the boolean 
variables, then F = F(V) is solution of Problem 1. On the contrary, the re- 
construction attempt fails and the algorithm chooses a different position of the 
four bases A, P,C, P, and repeats the procedure. The number of reconstruc- 
tion attempts is bounded by the number of different positions of the four bases 
A, P, C, P, and this is at most m^n^. Consequently: 

Corollary 1. Problem 1 can be solved in 0{m^n^) time. 

Remark 1. An 8-connected Au-convex set is a Q-convex set around p{x^y) = x 
and q{x,y) = y with at least one point in each row and column. This class 
of lattice sets is a well-know generalization of the class of Au-convex polyomi- 
noes [2,5] which are 4-connected and convex in horizontal and vertical directions. 
Boufkhad et al. [3] studied the “approximate consistency with two directions” 



An Algorithm for Reconstructing Special Lattice Sets 121 



problem on the class of hv-coiweyi polyominoes. In detail, given a pair of vectors 
V = (fi, . . . ^Vn) and H = {hi^ ^ hm), they want to reconstruct an to-convex 
polyomino whose X-rays along vertical and horizontal directions are such that: 
\XpF{i) — Vi\ < 1 for i = 1, . . . ,n and \XqF{j) — hj\ < 1 for j = 1, . . . ,m (i.e., 
P = V — 1, P' = V F l,Q = H — 1 and, Q' = H F 1). Our algorithm solves 
this problem in polynomial time on the classes of 8-connected hv-corwex sets 
and /i'c-convex polyominoes (with an extra condition on the boolean variables). 
We point out that the goal of Boufkhad et ah is to solve the corresponding opti- 
mization problem. They want to reconstruct hv-coiwex polyominoes from these 
“approximate” horizontal and vertical X-rays and such that the sum of the ab- 
solute differences is minimum. By means of a SAT solver, the authors defined a 
heuristic algorithm for solving this problem. We do not know at this time if our 
algorithm can be used for solving this optimization problem in polynomial time. 

Remark 2. Mir sky [11] proved that “approximate consistency with two direc- 
tions” problem on the class of all lattice sets can be solved in polynomial time. 

The Exact Reconstruction. If P = P' and Q = Problem 1 becomes the 
Exact Consistency problem with two directions on the Q-convex sets around p 
and q. Notice that, Pi ~ Sj=i ^ necessary condition. This problem has 
been studied in [4] and the authors propose a greedy algorithm whose computa- 
tional cost is F n) min{m^, n^}). We are going to show an algorithm 

which is faster than the algorithm defined in [4]. Let A, B be two bases such that 
p{A) = 1 and p{B) = n. We build the following formula: 

EXACi:{P,Q,A,B) = QCONV f\ LB(p,^,pi) /\ 

l<i<n l<j<m 



Proposition 1. EXACT(P, Q, A, P) is satisfiable if and only if there is a Q- 
convex set F around p and q containing A, B and having X-rays along p and q 
such that XpF{i) = pi, for i = 1, . . . , n, and XqF{j) = qj, for j = 1, . . . , m. 

Proof. 

— Assume that EXACT(P, Q,A,B) is satisfied by an evaluation V of 
{Vk{M))keK,MeA- By Lemmas 2,3 and 4, the set F = dhiV) satisfies the 
conditions: A^BeF^ XpF{i) > pi and XqF{j) < qj for all i^j. Then, 

mm n m 

j=l j=l i=l i=l 

and, since ^pF{i) = Pi and XqF{j) = qj for all i,j. 

— If is Q-convex around p and q and satisfies A,B G F, XpF{i) = pi, 
and XqF{j) = qj, by Lemmas 2,3 and 4, the evaluation V = 'i/{F) satisfies 
EXACT(P,Q, A-B). 
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The number of reconstruction attempts for the exact consistency problem is 
bounded by the number of different positions of the two bases A, and this is 
at most min{m^,n^}. Consequently: 

Corollary 2. Exact Consistency problem is solved in 0(mnmin{m^, n^}) time. 

3 More than Two Directions 

We now outline an algorithm for reconstructing Q-convex sets from their X-rays 
in more than two directions. Let us introduce a definition of Q-convex set around 
more than two directions. Let [/ be a set of d directions {uh = (i.e., 

pairs of coprime integers, with bh > 0). The linear form corresponding to vector 
'^h = (tt/i, bh) is Uh{x^ y) = bhX — Ohy- Given two directions Ui^ Uj G /7, we define 
four zones (M) around every M G as in the previous section. Therefore, 
there are 2d{d — 1) zones for each M G and we are going to select 2d of 
these zones. A point M of a line in direction Uh splits it into the following two 
semi-lines having origin in M: 

s+(M) = {N el?\ Uh{N) = Uh{M) and Uh-Op>Uh- oM} 
s^{M) = {N el? \ Uh{N) = Uh{M) and Uh-Ol^ <uh- Oli} 

where denotes the scalar product of two vectors and O is any origin point. 

Definition 2. An almost- semi-plane (or ASP) along U is a zone such 

that for each direction Uh of U only one of the two semi-lines s^(M),s^(M) is 
contained in Z^'^\m). 

Let 77o(M) be the ASP containing s^(M) for each h = 1,... , d. We denote 
the other almost-semi-planes encountered clockwise around M from 77o(M) by 
7Ii(M),... , 7l2d-i(M). For example, let U = ^ 3 }, with u\ = y^ U 2 = 

X, Us = xPy. The six ASP around a point M are: 7Io(M) = Z^'^^ ni{M) = 
zf’^^(M),7l2(M) = z[^^‘^\M),ns{M) = and 

IIs{M) = Z^'‘^\m) (see Fig. 4). Now we can generalize the Q-convexity to any 
set of directions: 

Definition 3. A lattice set F is strongly Q-convex around U if and only if for 
each M ^ F there exists an ASP IIk{M) around M such that F H IIk{M) = 0. 

Let us consider the approximate consistency problem on this class. 

Problem 2. Approximate Consistency with more than two directions 
Instance: 2d vectors Pi = ^ = 

(Pd,i, • • • , Pd, rid) Pd = (Pd,i^ • • • ^Pd,rid) whose elements are no-negative in- 
teger numbers and ^ ^ Pd,i^Pd,ndiPd,i^Pd,nd positive 

integer numbers. 

Question: is there a strongly Q-convex set F around U such that: 

Ph,i < XudP{i) < p'h^i for z = 1, . . . ,nh and h = 1, . . . , d ? 
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Fig. 4. The six ASP around M, with U — {ui,U2,us}, u\ — U2 — — x y 



A Q-convex set F around U whose X-rays are such that: py^i < Xu^F{i) < p'^ ■ 
for all h,i^ is contained in the lattice polygon: 

A = {N \1< Uh{N) <ny\/l<h< d}. 

We use the strategy of the previous section, by replacing the zone Zk{M) 
with Uk{M). Assuming that K = {0, 1, . . . , 2<i — 1}, we associate 2d boolean 
variables W(M), with /c G AT, at every point M in Z\ (i.e., one variable for 
each FLk{M) around M). We build a 2SAT formula APPROX on the variables 
{Vk{M))keK,MeA^ SO that there is a solution F of Problem 2 if and only if 
APPROX is satisfiable. If there is an evaluation V of {Vk{M))keK,MeA satisfy- 
ing APPROX, the corresponding set F solving Problem 2 is F = ^(F), with F 
defined as in the previous section. Conversely, if F is a subset of A solving Prob- 
lem 1, the corresponding evaluation V of {Vk{M))keK,MeA satisfying APPROX 
is F = F{F)^ with F defined as in the previous section. Since every ASP FLk{M) 
is equal to a zone defined by directions Ui,Uj, it is easy to general- 

ize the formulas to contexts having n directions. The generalization SQCONV 
of QCONV is such that SQCONV=2^o A ... A Z 2 d-i, where Zy corresponds to 
FLk{M) (i.e., Z^'^\m)). Thus, we have the following extensions of Lemma 2: 

Lemma 5. 

— For any set F C A the evaluation V = F(F) of the variables 
{Vk{M))keK,MeA satisfies SQCONV. 

— If an evaluation V of the variables {Vk{M))keK,MeA satisfies SQCONV, 
then F = F{V) is strongly Q-eonvex around U. 

A lower bound of I on the X-ray of F along the line uy{M) = i is expressed by 
a 2SAT formula. This formula can be constructed by proceeding as in the two- 
directions case. We do not describe the construction of the formula LB{uhfifi) 
for brevity’s sake. Finally, we fix d—1 pairs of bases for each directions, in order to 
give an upper bound of I on the X-ray of F in the direction uy. The chosen bases 
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correspond to the other d—1 directions. The upper bound is expressed by the for- 
. . . ,Ah-i,Bh-i,Ah+i,Bh+i,- ■ ■ ,Ad,Bd), where Aj,Bj 
are the bases of the direction Uj. This formula is a simple generalization of the 
upper bound formula for the two-directions case. Let (Pi, . . . , Pd, Pd) be an 
instance of Problem 2. We fix 2d bases Pi, . . . ^ Ad, Bd and then we build the 
following boolean formula: 

APPROX(Pi, P', . . . , Prf, P', Al, Pi, . . . , Arf, Bd) = SQCONVA 

^B{uh, i,Pi) A UB(p, Al, Pi, . . . , Ah-i, Bh-i, Ah^i, Bh^i, • • • , Ad, Bd) 

l<h<d 

We deduce that: 

Theorem 2. APPROX(Pi, P{ . . . , Ad, Bd) is satisfiable if and only if there is a 
strongly Q-eonvex set F around U eontaining the bases Ai,Pi, . . . ^Ad,Bd and 
having X-rays along Uh sueh that ph,i < Xu^F{i) < p'^i^ for i = I, . . . ,Uh and 

A = 1, . . . , d. 

The algorithm chooses d pair of bases, and builds the 2SAT expression APPROX. 
Assuming that n = maxjni, . . . , Ud}, we have that each formula APPROX has 
size 0{n^) and can be constructed in 0{n^) time. The number of reconstruction 
attempts is bounded by the number of different positions of the 2d bases, and 
this is at most Consequently: 

Corollary 3. Problem 2 ean be solved in time. 

If Ph = Pd for each 1 < h < d. Problem 2 become the exact consistency problem 
with more than two directions. In this case, we have to choose d—1 pair of bases 
for the upper bound and we complexity of algorithm for solving this problem is 
0{n^^). The convex lattice sets are special Q-convex sets, and so by uniqueness 
results for Q-convex sets proved in [6] , the algorithm can be used for reconstruct- 
ing convex lattice subsets from their exact X-rays in some sets of four suitable 
lattice directions, or in any set of seven prescribed mutually nonparallel lattice 
directions. This means that the two algorithms solve Gritzmann’s problem for 
these special sets of directions. 
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Abstract. In classical linear algebra, the question to know if a vec- 
tor a G belongs to the linear space Vect{v^ ■ ,v^} generated 
by a familly of vectors, is solved by the Gauss pivot. The problem in- 
vestigated in this paper is very close to this classical question: we de- 
note [.Jn the function of defined by l{xi)i<i<n\n = (L^iJ)i<z<n and 
the question is now to determine if a given vector v ^ belongs to 
[Vect{v^ , • • • ,v^}\n- This problem can be easily seen as a sytem of 
inequalities and solved by using linear programming but in some spe- 
cial cases, it can also be seen as a particular geometrical problem and 
solved by using tools of convex geometry. We will see in this framework 
that the question v G [Vect{v^ , ■ ■ ■ generalizes the problem 

of recognition of the finite parts of digital hyperplanes and we will give 
equivalent formulations which allow to solve it efficiently. 

Keywords: chord, convex hull, digital hyperplane, simplex, slice. 



1 Introduction 

The problem that we are going to consider in this paper belongs to a set of 
questions of geometry of numbers, which have been deeply investigated since 
the beginning of the century. The problem which can be considered as the origin 
of the others, is the one which consists in computing if a given vector v G 
belongs to the linear subspace generated by a familly of W^: 

V G Vect{vi/1 < i < k}7 

The method of computation used for solving this question is the very classical 
Gauss pivot. Many interessant problems are close to this easy and basic ques- 
tion: if we consider no more the linear space Vect{vi/1 < i < k} but only the 
additive group Group{vill < i < k} = {Yli=i < i < k^ai ^ Z} gen- 

erated by the vectors 'e*, the problem is a system of diophantine equations. We 
could also be interested in the distance between v and Vect{vi/1 < i < k} or v 
and Group{vill < i < k} instead of the single membership of v to these sets. If 
the choosen distance comes from a defined quadratic form, it leads to classical 
computations but it is not the case if one makes the choice of the distance di 
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associated with the norm Ni{xi)i<i<n = \^i\ distance doo asso- 

ciated with the classical norm A/'oo(^i)i<i<n = max{\xi\/l < i < n}. We can 
mention Kronecker’s approximation theorem which provides a characteriza- 
tion of the vectors v whose distance doo{v,Group{vi/l < i < k}) to the group 
Group{vill < i < k} is stricly less than a given e [S88]. 

The question investigated in this paper is of the same kind: we introduce 
the function [.Jn : ^ defined by [(x^)i<i<njn = (L^d)i<«<n where [rj 

denotes the integer part of a real number r (in other words, the largest integer i 
verifying i < r) and the question is to determine whether a vector v ^ belongs 
to the image of Vect{v^,v ‘^, ... by the map [.J^ : 



The problem of determining if v G [V ect(v^ , v‘^ , . . . does not appear 

in the literature of geometry of numbers and we can give two reasons to explain 
this absence: 

— the first one is that the function integer part has never been considered as 
a serious source of problems in classical mathematics and if it is now the 
case, it is especially a preoccupation of computer scientists for which the 
truncature is a usual practical tool. 

— the second reason comes from the fact that the analytical form of the question 
is quite ordinary. It is simply a system of inequalities: by denoting Vj the 
coordinates of v e and Vj the coordinates of v'^ G the problem is to 
determine whether there exist k real numbers ai, a 2 , ak verifying the 
system of n double inequalities 



for Ij < n. The systems of linear inequalities is of the domain of linear pro- 
gramming and there exists nowadays many different methods to solve them. 
The famoust one is the simplex method [Ga60] and we will come back later on 
its geometrical meaning in order to provide geometrical efficient algorithms. 
A second solution is the classical method of Fourier-Motzkin which con- 
sists in eliminating the variables. A third one is an algorithmic technique 
eliminating the redundant linear inequalities and described in [E87]. There 
exists many others but the aim of this paper is not to present them. Our 
ambition is mainly to bring out the geometry which can be hidden in our 
special kind of systems. It is the reason why we are going to choose a geo- 
metrical point of view. We are however unable to do it in the general case 
and then we have to restict ourself to two special cases: 

• when the rank of the familly is n or n — 1, technics of linear 

algebra allow to conclude, 

• when the constant vector (l)i<i<n belongs to the linear subspace 



V G [Vect{v^,v‘^, . . . ,v^)\n ? 



k 




i=l 
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Vect{v^ ^ ... , the question to know whether v g[V ect{v^ ^ v ‘^ ^ ... , 
generalizes a problem which has been discussed many times in the previous DGCI 
and which consists in recognizing the finite parts contained by naive digital hy- 
perplanes ([DR94,D95,FST96,R95]). We will give a geometrical characterization 
of the solutions by introducing a new tool, the chords’ set of a part of and a 
classical notion, the convex hull. This geometrical characterization provides two 
natural algorithms and as the question generalizes the recognition of the digital 
hyperplanes, they can naturally be used to determine if a finite part belongs to 
a digital hyperplane of double inequality h < < h -j- |an|. 

As conclusion of this introduction, we have to say a word about the applica- 
tions and we can begin with the recognition of the digital hyperplanes. A good 
algorithm of recognition can be the origin of many applications in digital im- 
agery and it can lead to a whole set of geometrical methods of treatment of 
images. We have however to stress the applications of the main problem that we 
have considered because it has an inexhaustible number of numeric applications: 
one of them is the use of what we could call the polynomial approximations of 
the digital functions. We call polynomial approximation of a digital function 
/ : — > Z under a domain T) C Z^ a polynomial P{Xi)i<i<n verifying for all 

X e D, 



f{x) = [P{x)\ . 

These objects provides values {(PP/dxi^.dxi^ dxi^) which can be seen as 

the differentials of the function /. The computation of a polynomial approxima- 
tion of degree less than r for a function / under a domain D supposed ordered 
and of cardinality n consists in determining a linear combination of the vectors 
{l)ieD, {i)ieD, {i^)ieD, ■■■■, {P)ieD whose image by [.J„ is {f{i))ieD- Then it 
is a direct application of the second case that we have planned to investigate. 
We can also approach the function / by other things as polynomials and use it 
to create digital filters or many other things that we have not place enough to 
describe. 



2 Easy Cases 

The general question in which we are interested, consists in determining if a 
given vector u G Z^ belongs to the image of the linear subspace 

Vect{v^/1 < i < k} hy the map [.J^- 

The first easy case is trivial: when the rank of the familly ...., is n, 

the linear subspace Vect{v^ /I < i < k} is equal to the whole space R’^ and as 
[.Jn is sujective, it is clear that the intersection of the reciprocal image of any 
V ^ hy [.J^ with Vect{v'^/1 < i < k} can not be empty. 

The second easy case is not trivial but it is not necesary to use the double in- 
equalities to understand what happens : when the rank of the familly 
is n — 1, the linear subspace Vect{v'^/1 < i < k} generated by these elements can 
be characterized by a linear form ip (then we have Vect{v'^ /I < i < k} = {x ^ 
W^/ip{x) = 0}) . We denote a a point of the set {0, 1}^ where ip is minimum 
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and b a point of {0, where (p is maximum. By denoting the reciprocal 

image of v by we can see that 

]ip{v + a)-ip{v + b)[c V?(Un^(0) C [(p{v + a);ip{v + b)]. 

There are two trivial consequences : 

— if 0 belongs to ](^('c + a); (^('c + 6) [, then 'c belongs to [Vect{v'^ /I < i < k}\n- 

— if 0 does not belong to [(p{v + a); (p{v + 6)], then v does not belong to the set 
[Vect{v'^ /I < i < k}\n^ 

and there remains only one case: the one where one of the two values (f{v-\-a) 
or cp{v + h) is null. In that case, we just have to compute p^{v) \ 

— if p^{v) = 0 then v belongs to Vect{v'^ /I <i<k} and then also to its image 
\Vect{v'^ /I < i < k}\n- 

— if (fi{v) ^ 0 then the boundary 0 is not reached by ^(Un^('^)) follows 

that V does not belong to \ Vect{v'^ /I < i < k}\n. 

As conclusion, it proves that we just have to compute (p{v). If it is null then v 
belongs to [Vect{v^ /I < i < k}\n and otherwise v belongs to [Vect{v^ /I < i < 
k}\n if and only if (p{v) g ] — —(p{a)[. The complexity of this computation 

is very low. It is the sum of 0(n^) for the computation of (p with 0(2^) for 
the computation of a and b, and this global time has nothing to do with the 
complexity of the investigation of linear inequalities. 

3 Geometrical Case 

We are still interested in the question: does a given vector v e belong to the 
image of Vect{v^ /I < i < k} hy the map [.Jn ? 

We suppose now that the constant vector (l)i<j<n belongs to Vect{vyi < 
i < k}. In order to simplify the notations and the computations, we are even 
going to assume that the vectors are free and that = (l)i<j<n. 



3.1 A Geometrical Equivalent Problem 

Before explaining the consequences of the previous hypothesis, let us see what 
happens for the question v G Vect{v'^/1 < i < k}? (for the moment, we forget the 
integer part [.Jn)* The system = v can be written aiVj — Vj = 

—ak for 1 < j < n and it follows that if there exists a solution, the n vec- 
tors {vj ^ ,Vj) containing the j coordinates of and v 

all belong to an affine hyperplane of M^. Conversely, if the n vectors of co- 
ordinates ('cj, ...., 'Cj) belong to an affine hyperplane, there exist real 

coefficients ai, a 2 , ak and a such that YliZi -\- av = — a/c(l)i<j<n- The 
familly ...., = (l)i<j<n was supposed free and it follows that a can 
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not be null (excepting the particular case where r’ = 0). It proves that v can 
be written Yli=i only if there exists an affine hyperplane of con- 
taining the vectors of j coordinates ^ ^Vj). It means that the linear 

question ‘Y G Vect{v^ /I < i < k} ?” has been transformed by using dual- 
lity into a problem of affin geometry. We can do the same with the question 
“u G \ Vect{v'^ /I < i < k}\n ?” with the difference that the part taken by the 
affine hyperplanes is played by the subsets of characterized by double inequa- 
tions of the form h < Yl^=i < ^ + |<^n|- In order to lighten the notations, 
these sets are simply called the slices of : 

Definition 3.1 A slice ofW^ is a subset of MA which can he characterized 
by a double inequation of form h < Ylt=i h + |on| where h, ai, 

are real numbers and where is not null. 

We can already notice that the traces of the slices under are digital 
hyperplanes [R91]. The affine transformation of the problem in which we are 
interested provides the following proposition. 

Proposition 3.1 .— Let v^, v^,.... , v^“^, v^ = (l)i<j<n be a free familly of 
vectors of'MA and let v be an element ofZ^. We have v G [VectjvYl < i < k}Jn 
if and only if the set of the n vectors (v|, v?, . . . , v!^“^, vj) of the j coordinates 
is contained by a slice o/R^. 

Proof. If u G [Vect{v'^ /I < i < k}\n^ it means that the system of n inequalities 

k 

Vj < ^ aiVj < Vj -\-l 
i=l 

for 1 < j < n, has some solutions. We can rewrite it 



k-l 

-dk < ^2 ~ ^ ^ 

i=l 

and it means that the subset of characterized by the double inequation of 
variable x = (xi)i< 2 <n ^ and of expression 



k-l 

dk A ^ ^ d^Xi X]^ d/g “t“ 1 
i=l 

contains the n vectors of the j coordinates for I < j < n. 

By setting h = — a/c, ak = —1 and ai = for i between 1 and /c — 1, we obtain 
a first part of the proposition. 

Now, let us prove the converse. We suppose that there exists a slice of 
characterized by a double inequation h < "^i=i c^iXi < h-\-\ak\ containing the n 
vectors Before anything, we shall begin by showing that 
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ak can be supposed negative. The set of the values taken by the linear form 
for the n vectors ('cj, is finite. Then there exists a 
real value e > 0 such that the points ('cj , , • • • belong to the set 

defined by 

k 

h<'^ aiXi < h + \ak \ - e. 

i=l 

By multiplying by —1, it follows that they also belong to the slice defined by 

k 

—h — \ak\ + e < —OiiXi < —h + e. 

By setting h' = —h — \ak\ ^ and a[ = we see that ak can be supposed 
negative and from now, we assume it. 

We have for any j between 1 and n 

k-l 

h <’^ aiVj + UkVj < h + \ak\- 



It follows that 



k-l 

h/\ak\ < 'Y^{ai/ak).v] - Vj < + 1 - 

i=l 

By setting ak = —h/\ak\ and ai = ai/\ak\ for all i between 1 and /c — 1, we have 

k-l 

-dk < ^ di.v] - Vj < -ak + 1 
i=l 



and it ends the proof. □ 

The proposition 3.1 transforms the question to know if v G [Vect{v'^ /I < i < 
k}\n into a problem which consists in recognizing the finite parts of slices. In 
the particular case of finite parts of this problem has arroused the curiosity 
of several authors in the previous DGCI ([FST96,DR94]) and it is known as the 
problem of recognition of digital hyperplanes. 

3.2 Recognition of the Finite Parts of Slices 

The recognition of the finite parts of slices is a problem of convex geometry, 
which leads to introduce the chords’ notion. 

Definition 3.2 .— Given a part A C the elements a' — a where a G A and 
a' G A are ealled the chords of A. The ehords^set of A is the subset ofMT denoted 
chord(A) and defined by 

chord(A) = {a' — a/a G A, a' G A}. 
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We also introduce the notion of convex hull: the smallest convex part of 
containing a set A is called the convex hull of A and is denoted convRn(A). 

These two operators have several properties as the one to commute: we have 
chord o convRn = convRn o chord. 

Theorem 3.1 There exists a sliee of eontaining a given finite subset 
A gMA if and only if the point u of eoordinates all null exeept the last one equal 
to 1, does not belong to convRn(chord(A)). 

It means that we just have to compute if u = (0,0,... ,0,1) belongs to the 
set convRn {chord{A)) in order to know if there exists a slice containing the finite 
part A. Theorem 3.1 can be proved directly but we are going to use an other 
way based on the elimination of variables according to the method of Fourier- 
Motzkin. 

Proof According to proposition 3.1, there exists a slice containing the finite set 
A = if and only if there exist real numbers h, <ai, 0 ^ 2,. ••5 <^n-i 

satisfying the system (S) of the n double inequalities 

n — 1 

< < Y -h<ai + l 

i=l 

for 1 < j < n. By eliminating the variable h, we obtain the system (S’) 

n — 1 

1 < Y ) + («n - «n) < 1 

i=l 

which according to Fourier-Motzkin, is satisfiable if and only if (S) also is. It 
means that the existence of a slice containing A is equivalent to the existence of a 
linear form p(xi)i<i<n = OiiXi—Xn sending the set chord{A) in ] — 1, 1[. As 

chord(A) is a symmetric set, it is also equivalent to the existence of a linear form 
T{xi)i<i<n = ~Xn Sending the set chord{A) in ]p{u)^ + 00 [. It implies 

that if there exists a slice containing A, then the hyperplane of equation p{x) = h 
where p{u) < h < min{(p{chord{A))) stricly separates chord{A) and the point u. 
The converse is also true: if there exists an hyperplane stricly separating the set 
chord{A) and the point r, we can write its equation p{xi)i<i<n = h or again 
Y17=i -\-anXn = h. The coefficient an can be supposed equal to —1 because 
it can not be null (0 G chord{A) would be on the same hyperplane as u). It follows 
from the separation and from (^(0) > (p{u) (the origin belongs to chord{A)) that 
(f{chord{A)) c]p{u)^ Poo[. Then we obtain a new equivalence : there exists a 
slice containing A if and only if there exists an hyperplane stricly separating 
chord{A) and u. At this step, the theorem of separation of compact sets [V64] 
ends the proof. □ 

Theorem 3.1 gives a geometrical characterization of the finite subsets of ZA 
contained by slices or in other words, by naive digital hyperplanes and at first 
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sight, one could regret that the tools used in the theorem, namely the chords 
and the convex hull, are continues operators. This critic is howewer not founded 
because the chords and the discrete convex hull can also be defined in . By 
denoting them chord and convzn , the property of commutation is preserved: for 
any finite set A C we have chord{convzn){A) = convzn{chord[A)) (it is not 
a trivial result) and it is exactly the trace of the continues set convRn (chord{A)) 
on It proves the discrete corollary of theorem 3.1 : 

Corollary 3.1 .— There exists a digital hyperplane of double inequality 

h < Ylt=i < h + |on| eontaining a finite set A c z/ and only if the 
veetor u = (0, • • • , 0, 1) does not belong to convzn(chord(A)). 

In the framework of digital geometry, corollary 3.1 shows that the recognition 
of finite subsets of slices can be solved by computing the digital convex hull of 
the chords of the set. 

3.3 Algorithms 

Theorem 3.1 provides a constructive characterization of the finite sets of 
contained by slices and it provides different geometrical algorithms to recognize 
them. 

The first one is natural. In order to determine if there exists a slice containing 
the finite part A C we begin by computing its chords’ set. We obtain a set 
and we compute its convex hull, by a given method of algorithmic geometry. It 
only remains to know if the point u belongs to the polytope convRn (chord{A)) 
by determining if there exists a face strictly separating u from the set. If there 
exists one, then its direction is the direction of a slice containing A and otherwise, 
there does not exist any solution. 

This algorithm can be made incrementaly and several technics improve its 
effectiveness. The central point of the algorithm is the computation of the 
convex hull of chord{A). When we add a point x to the set A (we denote 
A' = Au{x}), the chords’ set becomes chord(A') = chord{A)Utranslationx{A)U 
translation-x{A) and its convex hull can be obtained from convRn[chord{A)) 
and convRn(A). We can even notice that if x G convRn(A)^ then chord(A') = 
chord{A). Then an incremental algorithm can be obtained by repeating the fol- 
lowing process : 

We assume that we have computed convRn (A), convRn (chord{A)) and that u 
does not belong to this last set. We add the point x to A and obtain A' = Au{x}. 
If X G convRn{A), then convRn(A') = convRn(A) and convRn{chord{A')) = 
convRn(chord{A)). If x ^ convRn(A), then we can use the central step of 
the incremental Beneath-Beyond method [E87] to compute convRn(A'). We 
also have to compute convRn(chord{A')) which is equal to the convex hull of 
the three convex sets convRn[convRn[chord{A)) U translation-x{convRn [A))U 
translationx{convRn (A))) . An efficient method should be a “wrapping” process 
using the structures of convRn (A) and convRn (chord{A)) but as we are in dimen- 
sion n, it is not easy. We are then reduced to use the Beneath-Beyond method to 
obtain convRn{chord{A')) without expoiting the convex structure of convRn(A). 
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This algorithm and its incremental version are natural but they are not op- 
timal. We can wait much better because the structure of the convex polytope 
convRn(^chord{A)) (equal to chord{convRn (^A))) seems able to be directly con- 
structed from the structure of the convex polytope convRn(A). It is however not 
yet comp let ly clear. 

After this really constructive algorithm, let us present an algorithm coming 
from the simplex method. There are many different ways of thinking this famous 
method, and we are going to choose the one which involves the simplexes. Given 
a finite subset S of the question to determine if a point y G belongs to the 
convex hull of S can be solved in searching a simplex whose vertices belong to S 
and which contains y but instead of going all over the set of all the simplexes, 
the simplex method is based on the following strategy: 

Let us assume that the smallest affine space containing S is MA because 
we can also come back to this case, and let us take a n-simplex whose ver- 
tices belong to S. If it contains y, then y is in the convex hull 

of S and it ends the computation, otherwise almost one of the vertices in on 
the opposite side of y towards the face which does not contain it. It provides a 
linear form Lp which is negative in null in the other vertices and positive in y. 
Then we compute the maximum of (^9 in *5^. If it is null, y does not belong to the 
convex hull of S and otherwise, the vertices x^,... , 

where x'^ maximizes in 5 define our new simplex. We continue in this way 
until finding a null maximum or a simplex containing y. Problems of cycles can 
sometimes happen but a simple labelling of all the simplexes already met allow 
to avoid it. 

In our problem, we have to determine if the point u belongs to the convex 
hull of chord{A) and an easy solution is to use the simplexe method. The points 
used in the algorithm are, except at the beginning, the one which maximize 
linear forms and we can notice that given a linear form (p and a finite set A, 
a point maximizing (p in chord{A) is the difference x — x' between a point x 
maximizing (p in A and x' minimizing (p in A. It means that we do not need to 
compute the whole set chord(A) to obtain its maximum by a linear form: with 
the simplex method, we do not need to compute the chords’ set chord(A). It 
makes the algorithm very simple and easy to make incremental. 

The complexities of the two previous algorithms are not easy to evaluate 
and we are only going to increase them by rough boundaries. By denoting a 
the cardinality of the set A and n the dimension, the complexity of the first 
algorithm is 0 (a^L(^+i)/ 2 J ) Q^j^d the fact to make it incremental does not modify 
it. The complexity of the simplex method is the product between 
and the sum of 0{a) with the complexity of the integer computation of the 
inverse of a matrix of size n. This boundary is however excessive because it 
corresponds to the completly impracticable case where all the n-simplexes have 
to be investigated. We have to compare these values with the complexities of the 
classical methods solving systems of linear inequalities. The one of the Fourier- 
Motzkin algorithm is near from 0{a^ ) and the one of the method described 
by Edelsbrunner in [E87] is linear in a but still doubling exponential in n. 
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The complexities of the two geometrical algorithms that we have suggested, are 
better in n but not in a. It remains however probable that in practice, the best 
of all should be the simplest one, namely the algorithm based on the simplex 
method. 

4 Conclusion 

The aim of this paper was to bring out the algebra and the geometry hidden in 
the question “does v belong to the image of a given linear subspace H of by 
the map [.J n ?” but we were not able to do it in the general case. Then we have 
placed us in particular frameworks and we have obtained the following results: 

- If the dimension of AT is n — 1 or n, then classical tools of linear algebra 
allow to conclude. 

- If (l)i< 2 <n G TT, then the problem is equivalent to the one which consists 
in determining if there exists a slice containing a given finite part of 

and it can be solved by using the chords and the convex hull. This geometrical 
translation of the problem can not be generalized to any case. It is specific to 
this one because the vector (l)i<i<n plays a particular role being a kind of step 
of the function [.J^. Then the hypothesis is important but in practical, it is not 
too restrictive. We can as example notice that the problem which consists in 
computing polynomial approximations belongs to this framework, and then can 
be solved by the algorithms described at the end. 

Generally speaking, the problem of determining if v belongs to [H\n has an 
inexhaustible number of applications and its practical resolution would be a step 
in the development of what we could call a real digital technology. 
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Abstract. A generalization of a classical discrete tomography problem 
is considered: Reconstruct binary matrices from their absorbed row and 
columns sums, i.e., when some known absorption is supposed. It is math- 
ematically interesting when the absorbed projection of a matrix element 
is the same as the absorbed projection of the next two consecutive el- 
ements together. We show that, in this special case, the non- uniquely 
determined matrices contain a certain configuration of Os and Is, called 
alternatively corner-connected components. Furthermore, such matrices 
can be transformed into each other by switchings the Os and Is of these 
components. 



1 Introduction 

Consider a ray (such as light or X-ray) transmitting through homogeneous ma- 
terial. It is a well-known physical phenomenon that a part of the ray will be 
absorbed in the material. Quantitatively, let /q and I denote the initial and the 
detected intensities of the ray. Then 



/ = /o • , (1) 

where /i > 0 denotes the absorption coefficient of the material and x is the length 
of the path of the ray in the material. 

Consider now the 2-dimensional integer lattice Z^. Let m and n be positive 
integers. A discrete rectangle with size m x n is a special discrete set of I? 
determined as the intersection of m consecutive horizontal lattice lines with n 
consecutive vertical lattice lines. If F is a discrete set in Z^, then there is an m x n 
discrete rectangle containing F, it is called containing (discrete) rectangle. (For 
the sake of simplicity we take the smallest containing rectangle in the following.) 

Let us suppose that the horizontal and vertical projections of F are measured 
by detectors placed in the next column to left and in the next row to above, 
respectively, of the containing rectangle (see Fig. l.a). 

Then the projections can be computed according to (1). For example, in the 
case of the discrete set F given in Fig. la, the projections along the horizontal 
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a) b) 

Fig. 1. A 2D discrete set F, the corresponding binary matrix A, and the horizon- 
tal and vertical projections, (a) A 2D discrete set F (its elements are indicated 
by points) and the detectors measuring its horizontal and vertical projections, 
(b) The binary matrix A and its row and column sums 



lattice lines of the containing rectangle are ri = + V 2 = + 

and rs = e~^'^ + e~^'^ (from top to down). 

Let us use an equivalent representation of the 2D discrete sets and their 
horizontal and vertical projections. The containing rectangle including F can be 
represented by a binary matrix A = {aij)mxn as follows (see Fig. l.b): aij = 1 
if the lattice point corresponding to (i,j) is an element of F, aij = 0 otherwise. 
In order to use the generally accepted notation of numeration systems [5] let us 
introduce 



/3 = . ( 2 ) 

Clearly, /3 > 1, because /i > 0. Then we can define the absorbed row and column 
sums of A, Rj 3 {A) and S^{A)^ respectively, as 



F/ 3 (A) = (ri, . . . ,r^) , where ri = ^aijf3^, i = l,...,m, (3) 

j=i 

and 



5/3(A) = (si,...,Sn) , where sj =^aijfd \ j = l,...,n. (4) 

For example, in the case of matrix A given in Fig. l.b) Rp{A) = {f3~^ + 
/^-3, ^-1+^-4) and5^(A) = (/3-i+/3-3, F 

Then the reconstruction problem of binary matrices with absorption knowing 
the projections along horizontal and vertical lines can be posed as 
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Reconstruction DA2D{f3). 

Given: (3 > 1, m, n, R e and S 

Task: Construct a binary matrix A with size m x n such that 



Rp{A) = R and Sp{A) = S . (5) 

We say that a binary matrix A is a solution of the DA2D{f3) reconstruction 
problem if (5) is satisfied. 

If /3 = 1 then we have the classical reconstruction problem of binary matrices 
without absorption (as summaries see e.g. [1,3])- Select, now, a mathematically 
interesting case when (3 = (3q where 



/^o ^ — /^o ^ + /^o ^ 



(6) 



giving 



/^o — 



1 + a/5 
2 



(7) 



(The other solution of (6), namely (1 — >/5)/2, is not applicable in this case, 
see (2).) 

In this paper we discuss the problem of reconstruction of binary matrices 
from their row and column sums in the case of absorption corresponding to f3o. 
Because of the page limit, the proofs of theorems and lemmas are omitted here 
(but they are given in [4]). 



2 /3o-Representation 

Consider the absorbed row and column sums of the binary matrix A in the case 
of (3 = f3o: 



n m 

ri = Ckjfio^ , i = 1, . . . , TO, and sj = , j = 

j=l i=l 

(8) 

Using the terminology of numeration systems we can say that the finite (bi- 
nary) word an'-ain is a (finite) representation in base f3o (or a finite f3o~ 
representation) of for each i = 1, . . . , m, and, similarly, a\j • • • Umj is a /5o- 
representation of Sj for each j = The equations (8) mean also that 

the row and column sums of A are nonnegative real numbers having a finite /^o- 
representation with n and m binary digits, respectively (including the eventually 
ending zeros). 
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Let Bk denote the set of nonnegative real numbers having a /3o-representation 
with k binary digits {k > 1), formally, 

k 

Bk = e {0,1}} • (9) 

2=1 

Then 

Vi G Bm i = 1, . . . , m, and Sj G Bm^ j = 1, . . . , n, (10) 
are necessary conditions for the existence of a matrix A with 

RpoiA) = {ri,...,rm) and 5/3g(^) = (si, . . . , s„) . (11) 

2.1 Switching in /3o-Representations 

The /3o-representation is generally non-unique, because there are binary words 
with the same length representing the same number. For example, on the base 
of (6) it is easy to check the following equality between the 3-digit-length /5 q- 
representations 



100 = on . (12) 

Furthermore, Equality (12) may allow us to generate newer /^o-representa- 
tions: If in a finite length /5o-representation there is one of the sub- words 100 
or Oil, then it can be replaced by the other one. This transformation of binary 
words is called (ID) elementary switehing. 

It is clear that one or more ID elementary switchings give(s) new /5o-represen- 
tation(s) of the same number. As an example, see the 5-digit-length /3 q - represen- 
tations of 1/ f3o: 



10000 = 01100 = 01011 , (13) 

where the second /^o-representation is created from the first representation by 
switching on the first three positions and the third representation is created from 
the second one by switching on the last three positions. 

As direct consequences of (12), it is easy to see that 



10x00 = 01x11 
10x0x00 = 01x1x11 
10x0x0x00 = 01x1x1x11 



( 14 ) 
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where x denotes the positions where both /3o-representations have the same 
binary digit. (That is, such kind of transformation l(0x)^00 ^ 0(lx)^ll {k > 
0) between the sub-words of the /3o-representations can be performed without 
changing the represented value and without changing the values in the positions 
indicated by x.) The transformations described by (12) and (14) are called (ID) 
switchings. 

Now, we are going to prove that any finite /5o-representation of a number can 
be get from its any other /3o-representation by switchings. 

Lemma 1. Let a\ - • • and bi — -bk be different, k- digit-length Pq - representa- 
tions of the same number. Then 6i • • • 6/^ can be get from ai • • • by a finite 
number of ID switchings. 

Proof. A procedure by which the suitable switchings can be found is described 
in [4], 

Consequence, li - • • Ok and b\- — bk are different, /c-digit-length /^o-representa- 
tions of the same number, then there are positions i,i l,i P 2 (1 <i < k — 2) 
such that there is a switching between - • • ak and b\- • - bk on these positions. 

3 2D Switchings 

In this section we determine the 2D switchings, i.e., the transformations of binary 
matrices by which some of their Os and Is can be switched to each other such 
that the absorbed row and column sums remain the same. 

3.1 Connectedness 

Consider the class of m x n (m,n > 3) binary matrices with given row and 
column sum vectors in the case of absorption corresponding to (3q. Let 
{1 < i < m,l < j < n) denote the 3x3 discrete square 

%,i) = {*-1,*,* + !} X {j-l,j,j + l} . 

Let be a set of 3 X 3 discrete squares of {1, . . . , m} x {1, . . . , n} and let 
G U. Two kinds of connectedness will be defined on the set of 3 x 3 
discrete squares: side-connectedness and corner-connectedness. There is side- 
connection between and if {i'^j') G {{i — 2,j),{i,j — 2),{i,j-\-2),{iP 

2,j)}. The squares being side-connected to are called the side-neighbors 

of S(^ijy As an illustration see Fig. 2. a. The squares and S(^i > are corner- 
connected if {ifj') G {(i — 2, j — 2), (z — 2, j + 2), (i -h 2, j — 2), (i + 2, j + 2)}. The 
squares being corner-connected to are called the corner- neighbors of 

(see Fig. 2.b). 

There is a side-chain between and in E if a sequence of elements 

of E can be selected such that the first element is and the last element 

is and any two consecutive elements of the sequence are side-connected. 

(A sequence consisting of only one square is a side-chain by definition.) 
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'^ij) 



■XW 



a) b) 

Fig. 2. The discrete square and a) its side-neighbors and b) its corner- 

neighbors 



The set E is side- connected if there is a side-chain in E between its any 
two different elements (see Fig. 3. a). A side-connected set E is strongly side- 
connected if whenever ^ ^ ^ ^ ® then they are 

side-connected or they have a common side-connected neighbor (see Fig. 3.b). A 
maximal strongly side-connected subset of E is called a strongly side- connected 
component of E. 




a) b) 

Fig. 3. Side-connected sets of discrete squares. The set a) is not strongly side 
connected because ^ 0 but they are not side-connected and they 

have no common side- neighbor, b) A strongly side-connected set 



Let cr be a set of strongly side-connected components E^^\ . . . , E^^^ (A; > 1). 
Let E^^\ E^^ ^ G cr (1 </,/'< /c). E^^^ and E^^ ^ are corner- connected if whenever 
S G E^^^ and S' G E^^ ^ have a common position then S and S' are corner- 
connected squares. (Since E^^^ and E^^ ^ are maximal, S and S' cannot be side- 
connected squares.) There is a corner- chain between E^^^ and E^^ ^ m cr if a 
sequence of elements of cr can be selected such that the first element is 
and the last element is E^^ ^ and any two consecutive elements of the sequence 
are corner-connected. (A sequence consisting of only one component is a corner- 
chain by definition.) The set cr is corner- connected if there is a corner-chain in 
cr between its any two elements (see Fig. 4. a). 
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Fig. 4. Two corner-connected sets, both of them have two strongly side- 
connected components 



3.2 Switching Patterns 

In order to identify not necessarily rectangular parts of binary matrices, we 
introduce the concept of binary patterns (or shortly, patterns) as binary valued 
functions defined on an arbitrary non-empty subset of {1, . . . , m} x {1, . . . , n}. 
(In this terminology binary matrices are binary patterns on discrete rectangles.) 

Let P be a binary pattern, its domain will be denoted by dom(P). The 
absorbed row and column sums of P are denoted by Rpq{P) and 5/3 q(P), respec- 
tively, where the Rh component of (P) is 

E 

(i,i)edom(P) 

for 1 < i < m and the jth component of Spq{P) is 



E 

(2y)edom(P) 



for 1 < j < n. 

Let us define the switching pair of P, P', as 






on dom(P). We say that P is a switehing pattern if 



Rp,{P) = Rp,{P') and Sp.iP) = Sp.iP') , 

and P and P' are a switehing pair. That is, the patterns of a switching pair can 
be get from each other by switching their Os and Is and still they have the same 
absorbed row and column sums. 
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Consider the following binary patterns: 

Oil 100 

= 10 0 and =011, 

100 oil 

both of them are defined on the discrete square . It is easy to check that 
and are a switching pair. They play an important role in the generation of 

switching patterns, and are called the 0-type and 1-type elementary 

(2D) switehing pattern^ respectively. 

3.3 Composition of Patterns 

The eomposition of two patterns P and P' is the function 

P ^ P' : dom(P) A dom(P') — > {0, 1} 

(A denotes the symmetric difference) such that 



G dom(P) \dom(P'). 

j), if(^,i) G dom(P') \ dom(P) . 

(That is, P * P' is undefined on dom(P) n dom(P').) 

For example. 



E 



( 0 ) 

(bj) 



* E 



( 0 ) 

(«+2,j) 



0 1 1 

1 0 0 
rf* rr* rp 

Ju Ju JU ^ 

1 0 0 
1 0 0 



(15) 



defined on A5(^+2,j) = {^ — lAA + 2, z + 3} x {j — 1, j, j + 1}. (Just for the 
sake of simple presentation, on the right side of (15) the whole sub- matrix on 
the rectangle {z — 1, . . . , z + 3} x {j — 1, j, j + 1} is indicated and x denotes the 
positions in the sub- matrix where the composition is undefined.) Similarly, 



0 1x11 

= 10 ^ 00 . ( 16 ) 

10x00 

defined on A5(^j+2) = — lAA + 1} x + j-b3}. It is easy to see 

that E^^\^ * p [^^^2 j) ^0 i) * j+ 2 ) switching patterns, their switching 
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pairs are 



(«+2,j; 



respectively, where 



E, 



( 1 ) 



* E 



( 1 ) 



(i,j) ^(i+2,i) 



1 0 0 
0 1 1 




10x00 


XXX 


and 


e\]\. * = 0 1x11 


0 1 1 




0 1x11 


0 1 1 







(17) 



These examples show how new switching patterns can be created from ele- 
mentary switching patterns by composition. Now, we show a general way how 
two (not only elementary) switching patterns can be used to generate another 
switching pattern. 



3.4 Composition of Elementary Switching Patterns 

Let S = {El , . . . , Ek} {k > 1) be a set of elementary switching patterns of the 
same type on a strongly side-connected set E = {5i, . . . , 5/^}. The composition 
of on Z’, denoted by C{S), is defined as follows. If k = 1 then C{8) = Ei. If 
A; > 1 then let us suppose that Si, ... ,Sk are indexed such that for each /(< 
k) . . . ,Si} is a strongly side-connected set and one of its squares is side- 
connected with Si-^i. (It is easy to see that such an indexing exists.) Then let 



C = C{8) = ((Zi *^2) *•••)* • 

Now, we are going to show that this definition is independent from the indexing 
of Z = [Si, . . . , Sk}- There are four cases depending on how many times the 
position {i,j) is in the sets [Si, . . . , Sk}- 

(i) If there is exactly one I such that (i,j) G Si, then 

C{i,j) = , 

where denotes the value of Ei in the position (i,j). 

(ii) If there are exactly two different li and I 2 such that (i^j) G and 
{i,j) G 5/2, then C{i,j) is undefined. 

(hi) If there are exactly three different /i,/ 2, and /s such that (z, j) G 5/^ n 
5/2 n 5/3, then two of the elementary switching patterns (say, Z/^ and Z/3) 
has the same value in the position {i^j) and the other one (Z/2) has a 
different value here, i.e., = 1 — In this case 

C{i,j) = e|y = e|y . 

(iv) If there are exactly four different and /4 such that {i^j) G 5/^ H 

5/2 n 5/3 n 5/4, then C{i,j) is undefined. 

That is, the value of C{i,j) can be decided simply on the base of the parity 
of the number of discrete squares of Z covering {i,j) (independently from the 
indexing of Z). Accordingly, if 8 is the set of 2D elementary switching patterns 
of the same type on a strongly side-connected set, then we can simply write C{8) 
to denote the compositions of the elements of 5. As an example, see Fig. 5. 



146 Attila Kuba and Maurice Nivat 



- 


PU 

lio 

Co 


X 

X 


OiO 


1 1 

— ^ - -1- 
1 1 


0 


X 


0 


ill 


1 1 

-1 - + - 
1 1 


1 |0 


X 


O'O 

0I0 


X 1 X 


1 


1 1 

1 1 
1 1 

-1 - i- - 


_i_Iq|o 

1 'O'O 


J L 

I I 

^ - 1- 



Fig. 5. The composition of the elementary switching patterns ^( 24)5 

E^^^y, ^(4 6)’ ^(6 4 ) denotes ’don‘t care’ position) 



Lemma 2. Let 8 be a set of elementary switehing patterns of the same type on 
a strongly side-eonneeted set of 3x3 squares. Then C(8) is a switehing pattern. 

Proof. The proof is given in [4] . 

Henceforth, the switching patterns constructed from elementary switching 
patterns of the same type by composition are called eomposite switehing pat- 
terns. We say that the composite switching pattern has 0-type/ 1-type if it is 
the composition of 0-type/l-type elementary switching patterns. For example, 
(15)-(17) and Fig. 5 show composite switching patterns of types 0, 0, 1, 1, and 0. 



3.5 Composition of Corner-Connected Components 

Lemma 3. Let and he earner- eonneeted eomponents. Let and 
be 0-type and 1-type eomposite switehing patterns on and on 

respeetively. Then is a switehing pattern. 

Proof. The proof is given in [4] . 

Let cr be a corner-connected set of strongly side-connected components 

{k > l).Let be 0 - or 1 -type composite switching patterns 

on respectively. Let us suppose also that if 17^) and Z’d ) are 

corner-connected then and ) have different type. In this case we say 
that 7 = {C^^\ . . . , is a set of alternatively earner- eonneeted eomponents 

on a. 

The composition of 7 on cr, denoted by ^( 7 ), is defined as follows. If A; = 1 
then C{'j) = If k > 1 then let us suppose that E^^\ . . . , E^^^ are indexed 

such that for each I (I < k) . . . , E^^^} is a corner-connected set and one 

of its elements is corner-connected with Z’d+i). (It is easy to see that such an 
indexing exists.) Then let 



C = F(7) = _ 
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It is easy to show that this definition is independent from the indexing of cr, 
because dom(C) = \ H and C{i^j) = where 

I G {l,...,/c}is the only index such that (z, j) G Henceforth, we may simply 
write C{j) to denote the composition of the elements of 7 . As an example, see 
Fig. 6 . 
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Fig. 6. Composition of two alternatively corner-connected components (the two 
components are indicated with thick lines) 



Theorem 1. Let 7 = {C^^\ . . . , {k > 1) be a set of alternatively eorner- 

eonneeted eomponents. Then C{'j) is a switehing pattern. 

Proof. It follows from Lemma 3 directly. □ 

Theorem 2. If a binary matrix has a switehing pattern C then there is a finite 
number of sets 71 , ... , 7 / (/ > 1 ) of alternatively earner- eonneeted eomponents 
sueh that C is the eomposition of 

Proof. The proof is given in [4] . 

Therefore, on the base of Theorems 1 and 2, it is proven that the 2D switching 
patterns are just the patterns created as the composition of alternatively corner- 
connected components. 

Finally, two consequences are mentioned (c.f. [ 6 ]): 

(i) A binary matrix is uniquely determined by its absorbed row and column 
sums if and only if it has no sub-pattern created by composition from a 
set of alternatively corner-connected components. 

(ii) If A and B are different binary matrices with the same absorbed row 
and column sums then A is transformable into H by a finte number of 
switchings of switching patterns created by compositions from sets of 
alternatively corner-connected components. 
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Abstract. Many mathematical results exist about continuous topolog- 
ical surfaces of negative curvature. We give here some properties of dis- 
crete regular tessellations on such objects and explain a characterization 
of discrete geodesics and areas that shows how such hyperbolic networks 
can be seen as intermediary structures between Fuclidean infinite tessel- 
lations (like square grid) and regular infinite trees. 

We do not use some possible group structures of this networks (Cayley 
graphs) but only geometrical arguments in our constructive proofs. Hence 
we can see that there are few geodesics in hyperbolic networks and that 
large areas have very unsmooth borders. 



1 Introduction 

A lot of interesting and deep results (see for example [1], [4] and the proceedings 
of the previous DGCI conferences) have been found about the topology of the 
three classical regulars tessellations of the Fuclidean plane, (with squares, equi- 
lateral triangles or hexagons). These tessellations are seen as discrete versions of 
the plane, they are often called Fuclidean discrete planes. In computer imagery, 
each cell of such a tessellation is viewed as a pixel of a fiat screen. 

At the opposite, there are very few results about regular tessellations of 
hyperbolic planes, even though these tessellations also are very interesting : to 
be more explicit, if a screen were a part of an hyperboloid, then it can canonically 
be divided in ” hyperbolic pixels” , which are cells of a tessellation of a hyperbolic 
plane. 

In this paper, we are focused on geodesic lines of these tessellations. We have 
two main results, which show large structural differences between hyperbolic 
discrete planes and the Euclidean discrete plane. The proofs of these results are 
based on precise isoperimetric inequalities in hyperbolic planes (we also give a 
general isoperimetric inequality). 

Our first result describes the set of geodesic lines between two fixed points. 
We prove that this set is ”thin” : precisely, there exists a positive constant c such 
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that if tti, tt 2 , a' and a” are points such that d(ai,a') + d{a' ^a 2 ) = d{ai^a") + 
d{a"^a 2 ) = d{ai^a 2 ) and then d{a',a") < c. 

Our second result proves that for each 3-tuple (ai,a 2 ,a 3 ) of points of the 
hyperbolic plane, the triangle (ai,a 2 ,a 3 ) is (nearly) empty. This result can be 
seen as a discrete version of the classical result about triangles of a continuous 
hyperbolic plane [3]. 

2 Definitions 

A graph is a set G = (V, with V a set of vertices, and E a symmetric subset 
of V‘^ , whose elements are called “edges” . 
a^b G V are called neighbors if (a, b) G E. 
a G V has a degree n G N if a has n differents neighbors. 

A path c of a graph is a sequence of vertices, (r’l, r’ 2 : •••, such that for 
all z, Vi and are neighbors. 

A planar graph is a graph with a planar representation (no crossing edges). 
A cycle is a path such that v\ = Vn- Given a planar representation of the 
graph, a cycle c cuts the set of vertices into three subsets : /nt(c), the finite set 
of vertices inside the cycle, c, the vertices of the cycle, and Ext{c), the set of 
vertices outside the cycle. 

A cell is a cycle with interior void and no edge inside. 

Definition 1. A network E{k,d) is the unique planar graph, with every eell of 
size k and with every vertex of degree d whose planar representation does not 
eontain an aeeumulation point. 

So, T(4,4) is the infinite grid isomorphic to 7?. 

Definition 2. A geodesie xy is a path from the vertex x to the vertex y sueh 
that there is no strietly shorter path. 

For all E{k, d), we note ^ = dk — 2d — 2k = {k — 2){d — 2) — 4 the eurvature 
of the network. If 7 > 0 then the network is infinite. If 7 = 0, then the network 
is called Euclidean, if 7 > 1 then the network is hyperbolic. Now, we will only 
consider hyperbolic networks E(k,d) (see [2] for details). 

For a path, or a set of vertices c, we note |c| the number of vertices it contains. 

3 Fundamental Properties 

First, some basic geometric properties, from Euler Formula : 

Lemma 1. A eompletely triangulated planar graph with n vertiees on the border 
and m interior vertiees is eomposed ofn-\-2m — 2 triangles (eells of size 3), and 
2n + 3m — 3 edges. 

Then, from the previous Lemma, we can deduce the more general lemma : 
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Lemma 2 . A planar graph whose every eell is of size k (with every vertex into 
some eell, no vertex of degree 1 ), with n vertiees on the border and m interior 
{n + 2m 



vertiees is eomposed of - 
2m — 2 



(fc-2) 



2) ,, , (^^ + 2m 

cells and 



2) k n 

{k-2) ^2 + 2 =''^- 



1 + 



k-2 



edges. 



Proof. We just triangulate cells : there are k — 2 triangles in a cell of size k, 
and the triangulation needs A; — 3 additional edges for each cell. 



Theorem 1 (Isoperimetric inequality). For a eyele c of a F{k, d), we have : 

- {k - 1 ) 7 ' 



Proof. We have to consider the planar subgraph G = {V, E) of a network F{k, d) 
such that V = c U Int(c) and E is the set of all edges inside c or in c. So G is 
composed of cells of size k. 

G has at least 2 leaving arc for each border vertex plus one every A: — 1 
border vertices (if we consider the border larger than a cell) to close the cells 
and d leaving edges for each interior vertex. So we can deduce the following 
inequality (using lemma 2, and m = \Int{c)\ et n = |c|) : 



in + 2m — 2) md 1 

n + ,n-l+ >— + „+. 



k - 1 



As the border is a cycle, we have to obviously round up the fractional part 
(if we note p the number of leaving edges, it verifies p{k — 1) > n so p > -^^)- 

(n + 2m — 2) ^ ^ m{d — 2) ^ n 



(.k-2) 



2.{k - 1 ) 



/ ^ /7 m (7 + 4) n n 

(n + 2m-2)-{k-2)> ^ 



n — 2A; + 



A;-l 



> mj 



k{n — 2{k — 1 )) > mj{k — 1 ) 

□ 

Hence, we obtain a linear upper bound of area by perimeter, that is a spe- 
cific property of hyperbolic networks, whereas area is quadratically majored by 
perimeter in Euclidean networks. We remark that the curvature of the plane 
naturally comes into the inequality. Areas are as little as curvature is great: d 
alone doesn’t make any differences, and k is of little importance in the factor 
and in the constant 2 (A; — 1). 
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Fig. 1. Orientation of cells for /c = 6 and k = 7 



Lemma 3. Every cell is correctly oriented according to the distance from a given 
vertex. If k is even, there is one closest point and one farvest point, if k is odd, 
there is one closest point or one farvest point and ^‘a transversal edge^’ : one 
edge that links two edges at the same distance (see figure 1). 

Proof By induction on the building of the network from the given vertex (fig- 
ure 6). 

So, for any vertex and any cell (with k even), there is only one vertex on the 
cell that minimize the distance with the given vertex. Informally, this property 
could be seen as a topological uniformity of the metric space. We must note that 
this property is still true on Euclidean networks, but is false on finite network 
(when 7 < 0). 

4 Characterization of Geodesics 

Given two points x and ^ of a network E(k,d), with distance A from each other, 
we consider the subgraph formed by all vertices on any geodesic xy and the 
edges of this geodesic (There are no transversal edges). 

We must note that, for any point z of a geodesic xy, the path composed by 
two arbitrary geodesics xz and zy is a geodesic xy. 

Lemma 4. The graph of geodesics xy is a planar subgraph delimited by two 
exterior geodesics. 

Proof. (By contradiction) We consider both half-borders of the subgraph that 
links X to y (it is possible because x and y are compulsory on the border, due 
to the fact that we consider only infinite surfaces). If one of this path is not a 
geodesic, then there exists an oriented arc (a, b) of this path that is a part of a 
geodesic yx and not a xy one. But, as a and b are on the border, we note that, 
due to planarity (see figure 2), this geodesic yx must cross itself : it must pass 
trough the same point twice. It is obviously not possible for a geodesic. □ 

The graph of geodesics is planar (as a subgraph of a planar graph), and due 
to lemma 3, the graph of geodesics is only composed of cell of size k if k is even, 
and 2k — 2 if k is odd. 



Some Properties of Hyperbolic Networks 153 



a b 




Fig. 2. The impossible geodesic 



Theorem 2 (Theorem of neighbor geodesics). Given two vertices x and y 
of a network r{k,d), and two geodesics xy (that we note u and v). Then, 
Int(uv~^) is empty. 

Proof. We first consider k as an even number. We use the same method as for 
Theorem 1 of isoperimetric inequality. We consider that the cycle (border of the 
area) is c = uv~^ , and we define the subgraph G of border c. We suppose (with 
any loss of generality) that u and v are disjoint (they share no vertex). 

G has at least 2 leaving edges for each border vertex and d leaving edges for 
each interior vertex. Additionally, if we consider k — 1 consecutive vertices on u 
(or v), we remark the necessary presence of at least 2 supplementary leaving 
edges, because we have to respect the fact that u (and v) is a geodesic and that 
cells are of size /c, as we can see from figure 3 : u can follow the boundary of a 
cell during at most the half of the perimeter. 



X Y X Y X Y 




Fig. 3. Impossible to add only one leaving arc on k — 1 consecutive vertices with 
the necessity of having a geodesic (between X and Y for A; = 6 here) 



So, we can deduce the following formula (using Lemma 2 and m = Int{c) 
and n = |c|). We note that here, the fractional part has to be rounded down, as 
the proposition “at least 2 leaving arc for k-1 vertices” is true on two paths u 
and V (but not on the entire cycle). (We remark that ~ 1) • 



(fc - 2) ^ 2 2 - 



md 

— + n + 



n + 1 
k-1 



- 1 



that we can simplify (as for Theorem 1) : 



2n — 6k 6 > mj{k — 1) 
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Now, we will prove that m is not large compare to n/2. Using that fact, we 
will split our problem into two smaller ones and with the property 
m = m' m" . To solve the entire problem, we will split every subproblem until 
they are trivial (the graph of geodesics is reduced to a single cell where m = 0) 
which implies that m = 0 for the initial problem. 

The trivial cases are n = k and n = 2k — 2 where obviously m = 0. 

If we consider m > n/2 — k, we can deduce by using the last inequality : 

2n — 6/c + 6 > mj{k — 1) > 
that we can simplify into : 

(n — 2k){'y{k — 1) — 4) < 12 — 4A; 

As A; > 4, we remark that {j{k — 1) — 4) > 0, so we have n < 2/c for m > 0 ! 
So m < n/2 — k. 

In every graph of geodesics of border c = uv~^, with u and v disjoint, m < 
n/2 — k means that for some 6 such that k/2 < S < d{x^y) — /c/2, there is 
no interior vertex /i such that d{x,y) = 6 (because there are n/2 — k P 1 such 
different 6, and not as many interior points). So there are only two vertices a 
and b with distance 6 from x, one on u and the other on v. Moreover, there is no 
edge ab (it would be a transversal edge !) and no edge between a and b because 
such an edge would link a vertex with distance d < 5 to another vertex with 
distance d > 5. Then, a et b are on a cell of the graph of geodesics : There is a 
cell F which contains a and b (see figure 4. As /c/2 < (i, P does not contain x, 
and as < d(x, y) — k/2, F does not contain y. 




Fig. 4. A cell in the middle 



We can now cut this problem into two strictly smaller ones, by using lemma 3 : 
one that consider the graph of geodesics xFx where Fx is the vertex of F farvest 
from X and the graph of geodesics yFy. 

We must remark that any geodesic xy can be obtained by concatenation of 
two geodesics xa and ay, or xb and by, due to the fact that a and b are the only 
vertices v that verify {d{x,v) = S;d{x,y) = d{x,v) + d{v,y)}. This is still true 
for any geodesic xFx or yFy. So the graph of geodesics xy is actually “split” into 
two smaller ones xFx and yFy. 
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Fig. 5. Two smaller problems 



The interior vertices of the initial problem will be interior vertices of exactly 
one of the two smaller problems, that we can cut until they are reduce to a very 
small trivial problem (c is one cell or two) where m = 0. 

So m of the initial problem is a sum of interior of subproblems that all are 
equal to zero. This proves the theorem for k even. 

If k is an odd number, we just remark that graph of geodesics have no 
transversal edges, by definition. As every cell in such a T(2i + 1, d) has a transver- 
sal edge, we remark that every cell in a graph of geodesic has size 2k — 2. Remov- 
ing transversal edges reduces the degree of vertices, but we note that for /c = 3 
there are at most two transversal leaving edges for a vertex, and for /c > 4 there 
is at most one. 

So the proof is the same for T(3,d) as for T(4,d — 2), and the same for 
r(2j + 5, d) as for T(4j + 8, d — 1). For d = 3, we have to rearrange the initial 
proof but the result is the same. □ 

Corollary 1. Let x, y he two points of a r{k, d) and 0 < < d(x, y). There are 

at most two points with distanee S from x and distanee d(x, y) — S from y. 

Hence, the set of geodesics between two points is a “narrow” graph (of a 
width of at most one cell). We must remark that, in every hyperbolic network 
r(k^d) we can find two points as far from each other as we want such that there 
is only one unique geodesic between them and we can find, in the same way, two 
points with two disjoint geodesics. 

5 Triangle Inequality 

In this section, we not only show that the metric space of an hyperbolic network is 
hyperbolic (it is a known result), but give computable results about geometrical 
properties due to hyperbolic metric. 

Theorem 3 (Theorem of void triangles). Let x, y and z he three points of 
an hyperholie T{k, d). Let the border (the triangle) he the eyele eomposed hy three 
arbitrary geodesies xy, yz and zx. Then, if the triangle has a minimal area (it 
means that it doesn^t eontain another xyz triangle in its interior), it does not 
eontain any interior point for d > 3 and at most one point for d = 3. 
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Fig. 6. Two types of geodesics 



Proof. As the previous theorem, we suppose m > 0 and we use an upper bound 
of edges on the graph of border c = xy yz zx. On each third of the border, we 
can remark the necessary presence of two leaving arcs every k — 2 consecutive 
vertices, due to the fact that the considered path is a geodesic, and that we do 
not have another geodesic “inside” . 




Fig. 7. Impossible to have only one leaving arc onk — 2 vertices with the necessity 
of having only one geodesic between X and Y (for k = 7 here) 



So we have the following inequality : 



n P m 



(n + 2m — 2) 

(fc-2) 



> 



md 

~Y 



+ n + 




3 

2 



that we can simplify : 



{k — 6) > my 
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Hence if > 4 we can verify that we have m < 0 for any hyperbolic r{k,d). 
If (i = 3, we can verify that the case m = 1 can be obtained in some little 
triangle (even minimal for k odd) as we can see from figure 5. □ 




Fig. 8. A xyz triangle with an interior point 



Corollary 2. We note D{x\y^z) the maximum distanee from x to a point in 
the interseetion of the subgraphs of geodesies xy and xz (it is always defined 
as X must he in this interseetion) . We eonsider x' , sueh a point with distanee 
D{x\y^z) from x. If we look at the triangle x'yz, we ean deduee : 



D{x, z) + D{x, y) > D{y, z) > D{x, z) + D{x, y) — 2.D{x\y, z) — 2k 

We must note that D{x'\y,z) = 0 in that example. Hence, distance on a 
hyperbolic discrete network is working in a similar way as in trees. 

Corollary 3. Due to the eharaeterization of geodesies, we ean remark that, if 
d > 3 and k > 4, every triangle (even non minimal) ean not have any interior 
points. 



This Theorem could be extended from triangles (3-gones) to ^f-gones. The 
result would be the same for d > ^. It means that it is difficult to have a border 
with a large area. When d is great, so is the curvature 7 , and areas are delimited 
by very unsmooth large borders. 

6 Conclusion 

Now, we know the properties of a hyperbolic network : areas are small, perimeters 
very unsmooth and discrete geodesics are thin. So we can see such networks like 
a compromise between common Euclidean network, with quadratic areas, simple 
perimeter and a lots of geodesics; and infinite trees, with infinite areas and unicity 
of geodesics. 
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For network algorithms, hyperbolic networks have the qualities of Euclidean 
grids, like good connectivity but as we can see from this article, they have quali- 
ties of tree networks, like efficient broadcast (exponential number of neighbors), 
and specific routing methods (thinness of geodesics). 

For graphical purpose, the main problem is still in the representation of such 
networks. They “grow” very fast and are very difficult to show in an efficient way. 
Mathematicians use some finite nice way of representation that is completely 
useless for practical uses. The solution seems to implement some three dimen- 
sional representation using the underlying hyperbolic structure of the network. 
This could lead to a better understanding of every hyperbolic three dimensional 
surface, allowing best polygonalization of special shapes. 
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Abstract. It is known that a set consisting of the digital curves whose 
’’original curves” are graphs of continuous functions, having at most 
two intersection points, pairwise, on a given interval, can be uniquely 
coded by five parameters. This result is applied to the set of digital 
hyperbola segments, corresponding to the hyperbolas of the form y — 
^^+7 inscribed into the (m x m)-integer grid. An 0(m- (log(m+ 1/5|))^) 
algorithm for recovering the digital hyperbola segment from its proposed 
code, is presented. 

Keywords: image processing, pattern analysis, reconstruction, digital 
hyperbola. 



1 Introduction 

The most often appearing digital curves in practice are those obtained by digi- 
tizing straight lines and conic sections. For the most of them the basic problems 
considered in computer vision and image processing, such as the efficient repre- 
sentation and reconstruction, as well as the parameter estimation, are already 
solved, by using different approaches [2]- [7], [10]. The coding scheme developed 
for the sets of the digital straight line and parabola segments, based on the least 
squares technique, is successfully applied to the set of the digital hyperbola seg- 
ments of the form y = ^ -\- (3 [9]. Unfortunately, that concept, which has the 
advantage of saving information of the approximate form and position of the 
original object, could not be applied to the segment of the general hyperbola 
because, in that case, the determination of the coefficients of the least squares 
hyperbola fit leads to a nonlinear problem of great complexity. The solution of a 
coding problem for the general digital hyperbola segment is suggested in [8]. It 
is shown that the set consisting of the digital curves whose ’’original curves” are 
graphs of continuous functions, having at most two intersection points, pairwise, 
on a given interval, can be uniquely coded by five parameters. This result, spec- 
ified to the set of the digital hyperbola segments, is briefed in Section 2 and the 
optimality of the proposed code is considered. One-to-one correspondence estab- 
lished between the digital hyperbola segments and their representations enables 
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reconstruction of the digital hyperbola segment from its proposed code. The re- 
covering algorithm is described in Section 3. As an illustration, some numerical 
results are presented. In Section 4 some concluding remarks are given. 

As it is usual in the digital picture analysis, we assume through the paper 
that all the appearing points have positive coordinates. In other words, the origin 
is placed in the left-lower corner of the given picture. 



2 Preliminaries 



Consider a hyperbola h, in the Euclidean plane, given by ^ + 7. It 

is digitized by digitizing method in which the first digital points (points with 
integer coordinates) below the curve are taken. 

We will be dealing with digital hyperbola segment obtained by digitizing 
part of the hyperbola lying between lines x = x\ and x = X2 for some xi 
and X2- Without loss of generality we can assume that xi and X2 are integers. 
We also require that f 3 ^ [xi,X2]; otherwise, the problem can be divided onto 
two independent digitization processes: the first one on the interval [xi, [f 3 \] and 
the second one on the interval [[/3] , X2]. If xi = A; and X2 = k n — then the 
associated set of digital points for the segment of the hyperbola h is the digital 
hyperbola segment Hk{h,n)^ defined by 



Hk{h,n) 




i = /c,A; + l,...,/c + n 




where [t\ is the largest integer no bigger than t. Then n is the length of the digital 
hyperbola segment Hk{h,n), i.e. the number of digital points of Hk{h^n). 

So, if we denote yi = + tJ 5 ^ = /c, /c -h 1, . . . , /c + n — 1 and [3 ^ 

[/c, /c-hn — 1], then the digital hyperbola segment Hk{h^ n) of an arbitrary length, 
is coded by five parameters {k,n^a{h),b{h)^c{h))^ where: 

• A; is abscissa of the left endpoint of the digital hyperbola segment Hk{h, n); 

• n is the length of the digital hyperbola segment Hk{h, n); 

/c+n — 1 

• a{h) = Y, Vii 

i=k 
/c+n— 1 

• b{h) = E * • 2/*; 

i=k 
/c+n— 1 

• c{h) = Yl ' Vii where g{x) is an arbitrary strictly convex 

i=k 

or strictly concave function having a second derivative on [A;, A; -h n — 1]. 



From the general result proved in [8], it follows that there do not exist two 
different digital hyperbola segments with the same proposed code. The proof of 
the statement, specified to the digital segments corresponding to the hyperbolas 
in the general form, is briefly reported here, since some of the details will be 
needed in the recovering procedure analysis. As it is already mentioned, the size 
of the observed grid is denoted by m. An auxiliary statement is used: 
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Lemma 1 Let a sequence of nonnegative numbers, satisfying 

0 < ai < a2 < . . . < at < 6i < 62 < . . . < < at+i < . • • < an 

and ai + tt2 + . . . + ttn = 61 + 62 + . . . + , he given, 

and let g{x) be either a strictly convex or a strictly concave function, which has 
a second derivative on the interval [ai,an]. Then 

9 {ai) + g{a2) + • • • + g{an) + gih) + 3(62) + • • • + g{bn) ■ 

The proof of this statement is also given in [ 8 ] . 

Theorem 1 Let g{x) be a strictly convex (concave) function having a second 
derivative on the interval [ 0 , m — 1 ] and let Hr{l,p) and Hs{t, q) be digital hyper- 
bola segments, with {r,p,a{l),b{l),c{l)) and {s,q,a{t),b{t),c{t)) - codes, respec- 
tively. Then 

Hr{l^p) = Hs{t,q) is equivalent to 

{r = s A p = q A a(l) = a(t) A b{l) = b{t) A c{l) = c{t)). 

Proof From the definition of the proposed code follows that 
Hr{l^p) = Hs{t,q) implies 

{r = s A p = q A a{l) = a{t) A b{l) = b{t) A c{l) = c{t)) . 

The opposite direction will be proved by a contradiction. 

For the given digital hyperbola segment Hk(h,n), the set Dk(h,n) contain- 
ing all the digital points lying above the x-axis and below the hyperbola h, with 
abscissas belong to the interval [k,kpn — l] is observed . Then the interpretation 
of the parameters a(A), 6 (A), c{h) can be given as follows: 

/c+n — 1 

• a(A) = Vi — S I is the number of the points of Dk{h,n); 

i=k {x,y)eDk{h,n) 

/c+n — 1 

• Kh) = L i- Vi = E ^ 

i=k {x,y)^Dk{h,n) 

is the sum of the x-coordinates of the points of Dk{h, n); 

/c+n — 1 

•c{h)= y g{i)-yi= y g{x), 

i=k (x,y)^Dn(h,k) 

is the sum of the values of the function g{x) in all the points of Dk{h, n). 

From the assumption that two different digital hyperbola segments Hr{l,p) 
and Hs{t,q) have the same representation, we conclude that they are observed 
on the same interval [r,r P p — 1 ] = [5, s + <7 — 1 ], and that the relations 



Dj-if^p) 7 ^ Ds{t,q) 



(1) 
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a{l) = a{t) , h{l) = b{t) , c{l) = c(t) (2) 

are satisfied. 

From (1) and the first relation in (2) follows that 

#{Dril,p)\D,{t,q))=#iDs{t,q)\Dr{l,p))7^0 ( 3 ) 

which means that hyperbolas I and t must have an intersection point xq G 
[r, r + p — 1]. Assuming b{l) = 6(t), we have 

(4) 

{x,y)eDr{l,p)\Ds{t,q) {x,y)eDs{t,q)\Dr{l,p) 

But the sets Dr{l^p) \ Ds{t, q) and Ds{t^ q) \ Dr{l^p) can be separated by the 
straight line x = xq in such way that, let’s say, x < xq for x G Dr{l^p) \ Ds{t^ q) 
and X > xo for X G Ds{t^q) \ Dr{l^p) (2). Contradiction with (3) follows. 

From (3) and the assumption c{l) = c{t) follows that 

9{x) = 9{x) . (5) 

{x,y)eDr{l,p)\Ds{t,q) {x,y)eDs{t,q)\Dr{l,p) 

If we suppose that I and t have two intersection points xi,X2 G [r, r + p — 1] 
(xi < X2), the sets Dr{l^p) \ Ds{t^q) and Ds{t,q) \ Dr{l^p) are separated by 
straight lines, x = xi and x = X2 in such way that, let’s say, x < x\ or x > X2 
ioi X ^ Dr{l^p)\Ds{t^q) and x\ < x < X 2 ^.oi x ^ Ds{t^q)\Dr{l^p) {2). 

In other words, taking into account (3) and (4), we have that the abscissas 
of the points of Dr{l^p) \ Ds{t^q) and Ds{t^q) \ Dr{l^p)j are divided into two 
sets in such way that the conditions of the Lemma 1 are satisfied. But the result 
following from Lemma 1 is in contradiction to (5). 

Since two hyperbola arcs can have at most two intersection points on the 
observed interval, the proof is finished. □ 

Since for the appropriate choice of g{x) all of its parameters are integers, this 
code is optimal, which is proved by the next lemma. 

Lemma 2 The {k,n^a{h)^b{h)^c{h))-code requires an asymptotieally optimal 
number of bits. 

Proof For coding numbers A;, n, a(h), b{h) and c{h) (for g{x) chosen to be, for 
example, any polynomial function) Oiiogm) bits are required. Since there are 
at least m different digital hyperbola segments which can be inscribed into the 
mxm integer grid, at least log m bits are required for representing them. □ 

Remark i . .* If I : y = + 7/ and t : y = + 7t Iwo hyperbolas 

corresponding to different digital segments on the interval [r, r + p — 1], from the 
proofs of Lemma 1 (conditions assumed) and Theorem 1 it follows: 
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r X r+p-1 

0 



Fig. 1. I and t have one intersection point xo C [r, r+p— l],so Dr{l^p)\Dr{t^p) 
and Dr{t^p)\Dr{l^p) are separated hy x = xq. Assuming a{t) = a(/), b{t) > b{l) 



• If I and t have one intersection point on [r, r + p — 1] and a{l) = a{t), then 

^ m<m 

• If I and t have two intersection points on [r, r + p — 1] and a{l) = a{t) and 
b{l) = b{t) hold, then 

+ 7t ^ c{l) > c{t) for g"{x) > 0 

^ + 7/ > ^ + 7t ^ c{l) < c{t) for g"{x) < 0. 

3 Recovering Algorithm 

The one-to-one correspondence between the set of the digital hyperbola seg- 
ments and the set of their proposed representations enables reconstruction of 
the segment from its code. In this section we describe a recovering algorithm. 
We assume that (a > 0 and (3 < k. Other cases are treated in similar way. 
First, we describe a procedure Set-a{k^ n, A, 5, a, H) which, for a given in- 
terval [/c, /c -h n — 1], parameters A, B and a, finds a hyperbola H \ y = + C, 

such that a = a{H) 

Procedure SeBa{k, n. A, 5, a, H) 

Input: The numbers k,n, A, B,a 
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Fig. 2. I and t have two intersection points xi,X 2 G [r, r + p — 1], and Dr{l^p) \ 
Dr{t^p) and Dr{t,p) \ Dr{l^p) are separated hy x = x\ and x = X 2 - Assuming 
a{l) = a{t) and b{l) = 6(t), c{t) > c{l) for g"{x) > 0 and c{t) < c{l) for g"{x) < 0 



Output: Hyperbola H of the form y = + C, satisfying a{H) = a. 

Step 1: Draw a hyperbola segment htemp • V = and compute a{htemp)- 

Step 2: Translate htemp by vertical direction (upward ifa — 

a (htemp) > 0 and downward otherwise) and denote obtained hyperbola by h' . 
Compute a(h'). (In this way, |a — a(h')| < n is preserved). 

Step 3: Let (k i — l,?/^), i = be the digital points obtained by 

digitizing the hyperbola h' . Find the numbers di, z = l,2,...,n, where di is 
the vertical distance between the point (/c + z — 1,^^ + 1), z = l,2,...n and 
the hyperbola h' . 

Step 4- Find the (a — a{h'))-th. smallest number among the numbers di. (Note: 
Any algorithm that finds the r-th median of the set can be used - for example 
see [1] which is linear time algorithm.) 

Step 5: Translate h' to its final position FT by (a — a{h'))-th smallest distance, 
in the vertical direction. 

Lemma 3 The time eomplexity of the proeedure Set.a{k^ n, A, 5, a, H) is 0{m). 

We continue with the lemma which, for the given coefficient (3^ determines the 

interval I (a) containing coefficient a: 
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Lemma 4 If Hk{h, n) = {(i, i = /c, . . . /c + n — 1} is the digital hyperbola 
segment obtained by digitizing the hyperbola h : y = + 7 ^ then 



ol G 



26(/i) — a(/i)-(2/c+n— l)+n-(2/c+n— 1) 

n 5 

2z — n — 1 



2b(/i)- 



a(/i)-(2/c+n— 1)— n-(2/c+n— 1) 

n 

2z — n — 1 



Proof. According to the digitizing method, for i = 1, 2, . . . , n it follows: 

+ [2//=+i-i,2//=+i-i + 1) 

and 

? ^ ^ + (A: + i - 1) - 7 e [(/c + i - !)• {k + i - !)• (yk+i-i + !))• 

k P t — 1 — jj 



This implies the following system of inequalities: 






1 



k P i — 1 — [3 



+ nq G [a(/i), a(/i) + n) 






/c + i — 1 
/c + i — 1 — /3 



n 

+ 7^(fc + i - 1) 

2=1 



G 



6(/i), b{h) + nk+ ^^^^ . 



By solving it, the statement follows. (Note; y 7 Gi-/3 < 0. ) □ 

2 = 1 

Remark 2. From above, it is not difficult to conclude that, for the different values 
of (U, while keeping [3 and a(h) fixed, the parameter b(h) can take all the integer 
values between 1 and n(2k n — 1). 



The following lemma permits a binary searching over the coefficient o;, while 
j3 is fixed: 

Lemma 5 Let H' and H" be hyperbolas corresponding to different digital seg- 
ments, obtained by calling Set-a{k, n. A' , B, a, H') and SePa{k, n. A" , B, a, H"). 
Then A' < A" is equivalent to b{H') > b{H"). 

Proof. Let H' and H" be given, respectively, by the equations 

y = ^4^ + C' and y = i where A + AA' = A". 

From ,n) Hk{H",n) and a{H') = a{H") = a we conclude that the 

(only) intersection point xq of H' and H" is in xq G (/c, /c + n — 1), and that 
the relation {C' — C"){xq — B) = A A' holds. Since B 7 k and xq 7 k, we 
have (xo — B) > 0. According to the assumption that A' > 0 and A" > 0, the 
condition b{H') > b{H") is equivalent to C' > C" (see Remark 1. and Fig.l). 
So, we have b{H') > b{H") 77 AA' > 0 77 A' < A". □ 

There is a possibility that the hyperbola H of the form y = + C, 

satisfying a = a{H) and b = b{H), for the chosen B is not found, which means 
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that \b{H) — b\ is unreachable for any A. It could happen only if the parameter /3, 
locating the vertical asymptote of the original (digitized) curve is very close to 
(far from) k (the beginning of the segment). So, in that case, in order to increase 
(decrease) b{H), the chosen B should be increased (decreased). 

Now, we give the procedure which, for a given parameter 5, interval [k,k-\-n — l] 
and numbers a and 6, finds a hyperbola H of the form y = + C, such that 

a{H) = a and b{H) = b. (We mention again that m is the grid size). 

Procedure Set-ab{k, n, 5, a, 6, H) 

Input: The numbers k^n^B^a^b. 

Output: Hyperbola H of the form y = + C, with a{H) = a and b{H) = b. 



Choose a coefficient A G /(o); 

Step 2: SeBa{k^n^ A^B^a^H)] 

Step 2: WHILE {b{H)) ^ b and the weight of I {a) > ^) DO 

{chose the next A G I {a) by using binary searching; (* Lemma 5 *) 

Set-a{k^n^ A^B^a^H) } ; 

Step 3: IF {b{H) + b) 

(* required accuracy is reached without finding appropriate A *) 

{increase B if b > b{H) or decrease B if b < KH); 

SeBab{k,n, B^a^b^ H) }. 

As a result of executing the Procedure Set-ab{k,n^ B^a^b^ H), we have the 
segment {(i, yi), i = /c, . . . , /c + n — 1} of the hyperbola H : y = + C for 

which a{H) = a and b{H) = b. If c{H) = c, the recovering process is finished. 
If not, we need the following result which permits binary searching over both of 
the coefficients a and f3' 

Lemma 6 Let H' and H" be defined by y = + C and y = x^b'^ab') 

C" , respectively and correspond to different digital segments, obtained by calling 
Set-obfk, n, B' , a, b, H') and SeBabfk, n, B" , a, b, H ") . 

Then c{H') > c{H") ^ {A' < A" and B' > B") if g"{x) > 0 , 

and c{H') > c{H") ^ {A' > A" and B' < B") if g"{x) < 0 . 

Proof. From 

n) ^ n), a{H') = a{H") = a, b{H') = b{H") = 6, 

we conclude that H' and H" have two intersection points, x\,X 2 G {k,k-\-n — l), 
satisfying (C' - C"){x - B'f - {AB'{C' - C") + AA'fix - B') - A'AB' = 0. 
According to the Remark 1, (see Fig. 2) we have c(iL') > c[H") C' > C" , if 

g"{x)>f) on [k,kAn — l], and 

c{H') > c{H") C' < C" , if g"{x) <0 on [k,k A n — 1]. 

Both of the intersection points of H' and H" belong to (k,k A n — 1), so 



Step 1: Set I {a) = 



2b— g-(2 /c+n— l)+n-(2 fe+n— 1) 2b— a-(2 fe+n— 1) — n-(2/ c+n— 1) 






(xi -S') • G2 -S') > 0 ^ -#^>0 



(xi - B') + {X2 AB' + AA > 0 
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which implies /\B' < 0 and /\A' >0, if g'\x) > 0 

and /\B' > 0 and AHi' < 0, if g"{x) <0. □ 

Finally, here is the recovering algorithm for the digital hyperbola segment from 
its proposed (/c, n, a, 6, c)-code: 

Algorithm HYPERBOLA{k, n, a, 6, c, H) 

Input: Numbers k (the abscissa of the left endpoint of the segment), n (length 
of the segment), a, b and c. 

Output: Hyperbola H : y = + C, which satisfies a{H) = a, b{H) = b 

and c{H) = c, i.e. which is corresponded to the coded digital segment. 

Step 1: Choose the coefficient B < k ; 

Set-ob(k^n^B^a^b^H) ; 

Step 2: IF {c{H) + c) 

{ choose the next B by binary searching; (* Lemma 6 *); 
Set-ob{k^n^B,a^b^H) } . 

The procedure will be terminated when the solution (one of the hyperbolas 
corresponding to the given code) is found. If the initial lower bound for B is 
given, the algorithm can be used for solving the recognition problem as well, 
since it will be shown that the solution, if exists, should be reached before the 
width of the interval containing f3 becomes less then 

In the end, complexity of the algorithm HYPERBOLA will be discussed. 

Lemma 7 Every digital hyperbola segment ean be obtained as the digitization of 
the hyperbola of the form y = + 71 ; where a and f3 are the coeffieients 

of the hyperbola y = +7 passing through three digital points, and and sp 

are positive, small enough numbers. 

Proof Consider a hyperbola h : y = + c. 

By translating h downward until it reaches the nearest digital point, (x^, y^), 
we get hyperbola hi : y = + 7. By changing parameter b till the next 

nearest point, (xp,yp), is reached, we get hyperbola h 2 ’ y = + 7 . Finally, 

changing the parameter a and reaching the third digital point (xa^ya) leads to 
the hyperbola hs : y = ^ + 7. 

So, hs passes through three digital points and y = ^^^^^ 7^+71 is determined 
by the points {x^ E e,ya ± e), {xp ± e,yp ± e) and (x^ ± e,yj ± e), where e is 
positive, small enough number, and sign is taken for those of the points 
{xajya)j {xp,yp), (xj,yj) which are elements of the digital segment, else is 
taken. Obviously, y = + 71 and h have the same digitization. □ 

Theorem 2 Any digital hyperbola segment, eorresponding to the hyperbola of 
the form y = + 7 and presented on the m x m integer grid, represented by 

its (k,n,a,b,c)-eode, ean be reeovered in 0{m • (log(m + |/3|))^) time. 
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Proof. The coefficients a and f3 of the hyperbola y = + 7 passing through 

three given points (x 2 ,^ 2 ), {xs^ys) are determined as follows: 

^ {xi - X2){xi - xz){xz - X2){y2 - yi){y2 - yz){yz - yi) 

{{V2 - Vi){xi - X:i) - {y:i - yi){xi - X2)r 



p ^ X2{y2 - yi){xi - X3) - xz{yz - yi){xi - x^) 

{y2 - yi){xi - xs) - {y3 - yi){xi - X2) 

So, for the difference between different coefficients ai and 0 ^ 2 , and analo- 
gously, (3i and /^ 2 , of the hyperbolas y = + 71 and y = + 72 , each 

passing through three points of a grid of size m x m, we have 

I"' “""I ^ 16 ^’ 



The width of I (a) is (see Lemma 4) 



2n{2k P n — 1) 

n+l-2i 



< 4{2k n 



1)(A; + n - 1 - , 



since 



V 'v 77/ T 1 — 2i Tl 

k i — 1 — f3 ^ 2{k n — 1 — (3Y 



Taking into account that k and n are at most equal to m, we have I (a) < 
12m{2mP |/5|)^, which implies that 0{log{m + \f3\)) iterations will be needed to 
obtain I (a) less then 

For locating /3, 0{\og{m - 1 - \f3\)) iterations will be needed, too; I{f3) will be 
less then after that. With Lemma 3, the statement follows. □ 



Table 1. contains some numerical examples. 



Table 1. The digital segment Hk{h,n), obtained by digitizing h : y = +7 

on the interval [k,k P n — 1]^ is coded by {k, n, a, 6 , c)-code, where g is chosen to 
be . After applying the recovering procedure, the parameters A^B^C of the 
hyperbola hrec • V = 7 ^ + having the same corresponding digital segment 
on the observed interval, are obtained 



OL 


p 


7 


(/c, 77, a, 5, c) 


A 


B 


C 


5.12 


2.88 


4.31 


(3,7,80,350,1842) 


5.278417477 


2.875 


4.223145 


(4,42,179,4182,125968) 


4.529032519 


3 


4.393433 


(45,27,108,6264,369864) 


0.890407522 


44 


4 


17.19 


-0.53 


2.77 


(1,4,36,77,209) 


10.758620689 


0 


4 


(10,22,70,1399,30927) 


0.744398733 


9 


3.83251 


(25,67,184,10247,633817) 


0.702846342 


24 


2.98608 
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4 Comments and Conclusion 

Our research has been motivated by the fact that the conic sections are the most 
often appearing digital curves in practice of computer vision and image process- 
ing. For all of them the different solutions of the coding problem are known, and 
for most of them either a recovering procedure or the parameter estimation of 
the coded digital curve can be found in the literature. But as far as the digital 
segment corresponding to the general hyperbola is considered, only the problem 
of representation can be solved, by applying the general result developed in [ 8 ]. 
The proposed code consists of five parameters. Four of them are strictly deter- 
mined, while the fifth can be chosen according to the practical reasons, since 
it depends on an arbitrary strictly convex (concave) function g{x)^ having the 
second derivative on the observed interval. In the previous sections the recov- 
ering procedure for the digital hyperbola segments, represented by that code, 
is described. If the particular function g{x) is noted, the recovering algorithm 
can be applied for any choice of g{x). In practice, if ^ + 7 is coded on 

the interval [/c, /c -b n — 1 ], the information about the sign of a and (k — (3) is 
supposed to be included in the code, as well; the conclusion analogous to those 
of Lemma 5 and Lemma 6 , permitting binary searching, can be derived for any 
of the cases. The result is obtained in 0{m • (log(m -h |/^|))^) time. 

The main contribution of the paper is that it gives a complete and efficient 
solution to the representation and the reconstruction problem connected with 
one class of the digital images. The proposed code is simple, uniquely determined, 
recoverable and optimal, while the recovering algorithm enables the exact and 
fast reconstruction of the digital segment. For comparing, it might be noted that 
the algorithm for the exact reconstruction of the digital ellipse (circle) does not 
exist, even though the efficient coding scheme (based on the similar concept as 
proposed here) enables very efficient estimation of the parameters of the original 
object [ 10 ]. So the idea for the further research could be to find a constant time 
approximate reconstruction for the relevant parameters of the hyperbola from 
the code of its digital segment. 
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Abstract. A method is proposed which computes the visible points 
of surfaces in a 3-dimensional discrete space. The occlusion of surface 
points of an object by other object points is determined by shooting 
a discrete ray from each surface point towards the center of projection 
considering the intersection of the ray with other object points. Since 
the projection of points onto the viewing plane is done by a continuous 
mapping, additionally to the discrete ray, the location of the continuous 
projection ray is examined regarding its location to the surface points 
that are intersected by the discrete ray. 



1 Introduction 

The growing interest of computer graphics in the three-dimensional discrete 
space 7Z^ has led to new application fields of volume data: e.g. virtual reality 
in medicine [19], volume-based interactive design and sculpturing [16,11]. For 
such application fields, synthetic objects, i.e. geometrically defined objects in 
7Z^ or data sets obtained by rastering [1,15] geometric descriptions of objects 
given in IR^, are generated and the boundaries of these objects are required to 
be visualised. 

Three principal approaches for rendering discrete objects can be distin- 
guished: backward or image- order projection^ forward or object- order projection 
techniques, and hybrid techniques [14,10,9] that combine advantages of both the 
backward and forward projection methods. Backward projection algorithms tra- 
verse the pixels and solve the visibility problem for each pixel by casting a ray 
from the viewing point through each pixel of the image into the data space. This 
group includes ray- casting [5,6,3] and ray -tracing [18] techniques, which have 

* This work has been supported by a postdoctoral grant of the DFG (Deutsche 
Forschungsgemeinschaft) . 
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been specifically developed for the rendering of volumes. Forward projection al- 
gorithms, e.g. splatting [17,4], traverse the three-dimensional discrete scene and 
project its components onto the viewing plane. 

In contrast to splatting techniques, where a point of is represented by a 
kernel function in the viewing plane, we assume that a point of is mapped to 
the viewing plane such that its image is a point in IR^. Consequently, generally 
the image of a discrete surface in the viewing plane is a set of scattered of 
points. Moreover, assuming a continuous viewing plane requires a method for the 
determination of visible points, which provides results independently from the 
resolution of the hnal image. Therefore, rendering methods applying the z-buffer 
algorithm are not suitable to solve the visibility problem. Instead, a technique 
is needed which determines the visible points of a scene in object space, i.e. in 
A similar problem has to be solved in discrete ray-tracing [2] when it has 
to be determined if a point is in shadow, i.e. if the point cannot be seen from 
the light source. After the visible points are mapped onto the viewing plane, the 
final image could be obtained by applying a technique, e.g., as proposed in [12]. 

In this paper, a method is proposed which determines the visible points of 
closed surfaces in . Each surface forms the boundary of a discrete object. In 
a first step, front facing points are computed in the common way, comparing the 
normal vectors at the surface points with the viewing direction. The computation 
of occluded front facing points is performed utilising the idea of forward ray- 
casting: a discrete ray is shot from each surface point towards the centre of 
projection considering the intersection with other non-empty points, i.e. points 
which belong to objects of the scene. Since the actual projection of surface points 
onto the viewing plane is done by a continuous mapping, additionally to the 
discrete ray, the location of the continuous projection ray is examined regarding 
its location to the surface points that are intersected by the discrete ray. 

The paper is organised as follows: Section 2 states the definitions used 
throughout the paper. In Sect. 3, at first the problem is examined in detail. 
Afterwards, a solution is proposed in 2D which is then extended to 3D. Experi- 
mental results are presented and discussed in Sect. 4. Einally, Sect. 5 summarises 
the paper. 

2 Definitions 

The three-dimensional discrete space is constituted by that is the 3D 

array of points with integer coordinates in the Cartesian coordinate system. 
Two points p{xp,yp, Zp) and q{xq,yq, Zq) of where (| Xp — Xq \< 1) A 
(I Hp~yq 1^ ^v~^q 1^ 1), are said to be 6-adjacent \i\ Xp — Xq\ \ yp — yq\ 

^ \ Zp — Zq\= 18-adjacent if 0 <\ Xp — Xq \ -\- \ yp — yq \ -\- \ Zp — Zq \< 2^ and 
26-adjacent iif)<\xp — Xq\^\yp — yq\^\zp — Zq\<‘^. Points /c-adjacent to p, 
where k G {6, 18, 26}, are called k-neighbours of p. A k-component of a set A is a 
maximal subset of A in which between each pair of points p, q exists a sequence 
of distinct points P = {p = po,Pi,---,Pn = q} of A whereby two consecutive 
points Pi and p^+i with 0 < i < n along the sequence are /c-adjacent. 
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The surface S C to be rendered is assumed as a closed surface that forms 
the boundary of a finite object O C whereby O is a 6-connected component. 
The points of O are called object points. The surface S C O is constituted of all 
points of O that are 6-adjacent to some point of O = — O. Each point of S is 

also required to be 26-adjacent to some point of O — S. Note that S may consist 
of more than one component. The points of S are denoted as surface points. In 
contrast, the points in O — *5^ are called inside points and the points of O are 
named empty points. 

3 Determination of Visible Surface Points 

In general, a point on a closed surface is visible only if it is front facing, i.e. 
the surface in this point is oriented towards the observer, and the point is not 
occluded by another point. 

The computation of front facing points is rather trivial and can be done 
as described in Sect. 3.1. In contrast, the determination of occluded points in 
discrete space needs further investigations. Section 3.2 describes the basic dif- 
ficulties of the solution of the occlusion problem, which arise while employing 
the idea of discrete forward ray casting. In Sect. 3.3, an approach is introduced 
to determine occluded points in 2D space. Then, in Sect. 3.4 this approach is 
extended to 3D. 

3.1 Front Facing Surface Points 

The culling of back facing surface points in discrete space is done in the same 
way as for polygons: the cosine of the angle S between the normal vector at a 
surface point and the vector representing the viewing direction is determined. If 
cos (6) < 0 the point is back facing, otherwise it is front facing. 

Clearly, a normal vector associated with each point of the discrete surface is 
essential to cull back facing points. The normals are required to represent the 
local surface geometry. The determination of normal vectors at discrete surfaces 
is an active field of research. For example, the method introduced in [13] provides 
results that are suitable for the purpose of this work. However, to deal with the 
problem of visibility independently of the computation of the normal vectors, 
one could also associate the discrete surface points with the true normals of the 
underlying continuous surface. 



3.2 Forward Ray-Casting 

To determine the surface points that are not occluded by other points, the idea 
of forward ray-casting is utilised: shoot a discrete ray, denoted as discrete view- 
ing ray^ from each front facing surface point along the viewing direction and 
check if the ray intersects any other object point. A discrete viewing ray is the 
rasterization of the straight line segment between a surface point and the centre 
of projection. 
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The basic problems, which arise for the discrete viewing rays and are matter 
of this section, address the features of a ray in order to define a point as occluded 
and the connectivity of the rays. There are two basic approaches possible to solve 
the first problem: a point is not occluded only if its discrete viewing ray does 
not intersect any other object point, or the ray is allowed to be tangent to the 
surface, i.e. the ray may intersect other surface points but no inside point. These 
two approaches and their related problems are discussed below. 

Viewing Rays do not Intersect any Object Point. The following assump- 
tion would be the simplest approach when solving the visibility problem in 7Z^ 
by forward ray-casting: a point is not occluded if its discrete viewing ray does not 
intersect any object point. However, this assumption is too strict. Frequently, 
the viewing ray of a surface point close to the contour of a surface is almost 
tangent to the surface, i.e. the ray intersects surface points but no inside point. 
Examples for this fact are shown in Fig. 1(a) and (b) for orthographic and per- 
spective projection, respectively. For simplification, these examples are given in 
2D. Clearly, the viewing rays shown in the Fig. 1 intersect other surface points. 
The condition of non-occlusion, as stated above, would lead to a missing of such 
points. A similar problem has been reported by Delfosse et al. [2] for shadow rays 
in discrete ray-tracing. They solve the problem by considering the real bound- 
ary of the underlying continuous object. This approach is not suitable for our 
work since we consider only the discrete objects, without any assumption on the 
underlying continuous objects. Consequently, we need a further specification of 
the condition for non-occluded points to allow viewing rays to be tangent to the 
surface, so that the discrete viewing ray of a visible point may intersect other 
surface points. 




□ inside point 

□ surface point 

H point of the viewing ray 



Fig. 1. Discrete viewing rays intersecting a surface point for (a) orthographic 
projection and (b) perspective projections 



Connectivity of Viewing Rays. If there is no further restriction on the 
topology of the surface, i.e. *5^ is defined as in Sect. 2, the discrete viewing rays 
have to be 6-connected to avoid a traversal of the object by the rays while 
intersecting only surface points. In such cases, the viewing rays are not tangent 
to the surface. An example in 2D is shown in Fig. 2(a). For the points p and q, 
a 26-connected viewing ray would pass through the object, hitting only surface 
points. 
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(a) 



P 




viewing plane 



□ 

□ 



inside point 
surface point 



(b) 



Fig. 2. 26-connected viewing rays and (a) a surface with no restriction and (b) 
a surface satisfying the definition from discrete topology 



The generation of 26-connected rays is twice as fast than that of 6-connected 
rays since a 6-connected ray consists of about twice as many points as the 26- 
connected ray. For optimisation purposes, a technique proposed in [18] can be 
applied to speed up the usage of 6-connected rays: to traverse the empty space 
between the objects of a scene quickly, the rays are 26-connected until they come 
close to an object. Then the connectivity of the rays changes to 6-connectivity. 
Nevertheless, 26-connected rays would be preferred for practical applications 
because of performance reasons. 

Assume S satisfies a surface definition as known from discrete topology [8,7]. 
Then S can be viewed as the discrete analog of a closed two-manifold surface, 
and S is the minimal set which separates — S into two non-empty 6-connected 
sets. Thus, each point of S is 6-connected to some point of O — S and to some 
point of O. Experiments have shown, that 26-connected viewing rays are suffi- 
cient if S has these properties. Fig. 2(b) shows an example for this case in 2D. It 
is beyond the scope of this paper to proof this fact. However, this will be a mat- 
ter of future work. In Sect. 4, results are shown for both the cases 6-connected 
and 26-connected viewing rays. 



Viewing Rays may Intersect other Surface Points. Allowing the discrete 
viewing rays to intersect surface points but no inside point may cause problems. 
It cannot be ensured that the projection of the visible points leads to a correct 
image. The problem is illustrated in Fig. 3. Consider a convex discrete object O, 
i.e. O is the discrete representation of a convex continuous object O, located in 
the viewing space with coordinate axes x and y. The image of O, denoted O', 
is the projection of O onto the x-axis. If viewing rays of visible points may hit 
any surface point, the point p G O would be visible. Since p bounds the set of 
visible points of O shown in light grey in Fig. 3(a), one would expect that the 
image p' of p bounds O', i.e. it should be the point of O' with the smallest x- 
value in Fig. 3(a). This case is illustrated for the continuous object O in Fig. 3(b). 
However for the discrete object in Fig. 3(a), after the projection of the visible 
points the image q' of the visible point q forms the boundary of O' instead. 
This problem arises because discrete rays are applied to determine the visible 
points in and, in contrast, a continuous mapping is used for the projection 
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of the visible points onto the viewing plane. Therefore, a criterion is needed 
which ensures that points like p are excluded from the set of visible points while 
allowing viewing rays to intersect other surface points. 




Fig. 3. Problem arising after mapping visible points onto the viewing plane: (a) 
for the discrete case, the projection of the point p does not form an outline of 
the image as it does in (b) the corresponding continuous case 



3.3 Visibility in 2D 

To solve the problem described in the previous section, a condition is introduced 
to exclude points from the set of visible surface points whose projection would 
lead to artefacts in the rendered image. This problem may arise only for surface 
points whose discrete viewing rays intersect other surface points. 

At first, the problem is examined in 2D. Consider a front facing surface 
point p associated with a discrete viewing ray r^, which intersects only empty 
points or other surface points. In case Vp intersects any inside point, p is not 
visible. The point p is not occluded if for each surface point q along the 
continuous projection ray Vp of p does not intersect O. Note that Vp is the 
continuous representation of Vp. If fp is referred to as a vector subsequently then 
it is considered as the normalised vector representing the projection ray of p 
pointing towards the centre of projection. 

Assume a continuous surface patch Sq of differential size with a normal vec- 
tor Nq located at a surface point q ^ Vp and check the location of with respect 
to Sq. The vector Nq is assumed to be oriented towards O. Thus, it gives us some 
notion on which side of Sq the inside of the object O is located. Consequently, 
it can be determined if Vp crosses the inside of O in the neighbourhood of q. 

The formal realization of this test is described in the following and is illus- 
trated in Fig. 4. The straight line defined by fp separates the 2D space into two 
half-planes. This is illustrated in Fig. 4(a). By translating the normal vector Nq 
into p, it can be checked if fp intersects O in the neighbourhood of q depending 
on which half-plane p + Vg belongs to and where q is located. lip^Nq and q are 
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in different half-planes, p is not occluded by q. This case is shown Fig. 4(b). The 
line through q represents the surface patch Sq in q. The projection ray of p goes 
along the outside of the surface in the neighbourhood of q. This is additionally 
illustrated by the projections p' , q' and iV' of p, q and Nq^ respectively, onto the 
viewing plane, li p^ Nq belongs to the same half-plane containing q or q ^ p 
is not visible. In the first case, the projection ray of p goes along the inside of 
the object in the neighbourhood of q as illustrated in Fig. 4(c). In the latter 
case, the projections of p and q are identical, but q is closer to the observer and, 
therefore, it occludes p. 




Fig. 4. (a) Subdivision of the space by Vp into half-planes, (b) and (c) location 
of q in the half-planes with respect to the normal vector Nq of q 



The two cases discussed above do not cover a normal vector Nq having the 
same direction like f^, i.e. fp = Nq or fp = —Nq. If the normal vectors associated 
with the discrete surface points reflect the local surface configurations correctly, 
one can expect for fp = —Nq^ that Vp hits also some inside point of the object 
since it crosses S with respect to Nq from the outside to the inside. If fp = Nq, Vp 
would pass through S in q from the inside to the outside and should also hit 
some inside point. In fact, these cases could not appear for a discrete viewing 
ray which is tangent to a closed surface. If they arise nevertheless in a practical 
application, we assume for these cases that q occludes p as presented above. 



3.4 Visibility in 3D 

For the extension of the approach described in the previous section to 3D, an 
additional dimension for the location of the local surface in q with respect to fp 
has to be taken into account. 

Consider the surface patch Sq in q and its projection onto the viewing plane. 
Without loss of generality, assume the viewing plane is located in q. (More 
generally, the viewing frustum is assumed to be bounded by a front clipping 
plane located in q and parallel to the projection plane.) Unless the direction 
of Nq or —Nq is equal to the projection ray fq of q, the intersection of the 
projection plane and Sq leads to a curve 7 ^ with a normal vector V' that is 
the projection of Nq. This is illustrated in Fig. 5(a). Note, that for fp = Nq 
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and Tp = —Nq^ we make the same assumption as in 2D, i.e. p is occluded by q. 
For orthographic projection fp = fq. For perspective projection, the difference 
between Vp and fq are very small since q is intersected by Vp. Thus, we neglect 
this difference and assume in the following additionally if fq = Nq or fq = —Nq 
that q occludes p. 

To determine if fp intersects the surface in q^ we translate the tangent line 
at 7 g to p' . This line separates the viewing plane into half-planes and enables a 
similar approach as described for 2D to determine if q occludes p\ if p' -fN^ and q' 
are in the same half-plane, q occludes p. If they are in different half-planes, q 
does not occlude p. The distance d shown in Fig. 5(a) between p' and q' depends 
on the location of p and q in 3D and is limited since p and q belong to the same 
discrete viewing ray, which in turn is the rasterization of the projection ray. 
Consider the straight line that contains p and is perpendicular to the tangent 
line at 7 ^. If d is neglected, we obtain on this line the same scenario as illustrated 
on the viewing plane of Fig. 4(c) and (d). 




viewing plane 



viewing plane 



(a) (b) 

Fig. 5. Computation of the plane T 

For practical applications, we perform the test described above directly in 3D. 
For this, a plane is needed in 3D that separates the space into half-spaces. Clearly 
the plane, denoted with T in the following, must contain fp and the projection T' 
of T must be perpendicular to N'^. Then T can be computed as follows: determine 
a plane T containing the vectors fp and Nq^ i.e. T : fpX Nq. Then T is the plane 
perpendicular to T containing fp. More formally: (T T T) A (f^ G T). This is 
illustrated in Fig. 5(b). The point q occludes p, if q and p -f Nq are in the same 
half-space or G T. If they are in different half-spaces, p is not occluded by q. 
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4 Experimental Results 

Experimental results are shown below for the orthographic as well as the perspec- 
tive projection. The normal vectors for the examples were computed employing 
the method introduced in [13]. The images in Fig. 6 show the results for two 
spheres using 26-connected viewing rays. The spheres were rastered in an array 
of 75^ points. The occlusion was checked only for front facing points. The re- 
sults shown in Fig. 6 were obtained by applying our method (upper row), by 
not allowing the discrete viewing rays to hit any object point (middle row) and, 
finally by allowing the discrete viewing rays to hit any surface point without any 
further restriction (lower row). Clearly for the second case, many points particu- 
larly along the outline of the objects are missed. This leads to a gap between the 
two spheres. In the last case, consider the part of the outline of the sphere in the 
foreground which occludes the other sphere: especially for perspective projection 
in the example, points of the occluded sphere were projected “into” the image 
of the occluding sphere. In contrast, this problem does not arise for our method. 
The lines near the outlines of the images in the upper and lower row of Fig. 6 
are visible points which are very close to each other in the viewing plane such 
that these points are represented in the final image by neighbouring pixels. If 
more than one visible point would be represented by a pixel in the image, the 
point closest to the observer is viewed. 

Figure 7 shows the necessity of the application of 6-connected discrete viewing 
rays, depending on the properties of the surface. The two cuboids were rastered 
in an array of 50^ points. The problem arising for 26-connected rays along the left 
edge of the cuboid in the foreground corresponds with the situation illustrated 
in Fig. 2(a) by the point p\ viewing rays of points of the occluded cuboid pass 
through the surface so that they are projected “into” the image of the occluding 
cuboid. This can be avoided by using 6-connected viewing rays instead. 

5 Summary 

The determination of the visibility of points is a major task of rendering. In this 
paper we have proposed a technique which determines the visibility of points 
on a surface in 7Z^ ^ which is defined as the boundary of an object. The method 
introduced works in object space and employs the basic idea of discrete forward 
ray-casting. The visibility of each point is computed in 7Z^ considering a con- 
tinuous mapping for the projection of the points onto the viewing plane. This 
assumption makes the results independent of the resolution of the final image 
and enables an application of the method, e.g., for discrete ray-tracing to solve 
the shadow problem. We have shown by examples, that our method is suitable to 
solve the visibility problem and thus can be employed for rendering techniques 
such as those proposed in [12]. 

A remaining problem of this work, that is matter of future work, is a proof 
that 26-connected rays are sufficient if the surfaces to be rendered satisfy certain 
conditions. This would lead to a performance increase by using 26-connected rays 
instead of 6-connected rays for our method. 
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Fig. 6. Visible points of two spheres determined for orthographic (left column) 
and perspective projection (right column) by allowing discrete viewing rays to 
hit other surface points only if the projection rays do not intersect the object 
(upper row), not allowing intersections of any object point (middle row), and 
allowing intersections of any surface point (lower row) by the discrete viewing 
ray 
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Fig. 7. Visible points obtained from 26-connected discrete viewing rays (left) 
and 6-connected viewing rays (right), and an enlargement of a detail (lower row) 
showing the upper left corner of the cuboid in the foreground 
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Abstract. The aim of this paper is to describe a conceptual model for 
surface representation based on topological coding, which defines a sketch 
of a surface usable for classification or compression purposes. Theoretical 
approaches based on differential topology and geometry have been used 
for surface coding, for example Morse theory and Reeb graphs. To use 
these approaches in discrete geometry, it is necessary to adapt concepts 
developed for smooth manifolds to discrete surface models, as for ex- 
ample piecewise linear approximations. A typical problem is represented 
by degenerate critical points, that is non-isolated critical points such as 
plateaux and flat areas of the surface. Methods proposed in literature 
either do not consider the problem or propose local adjustments of the 
surface, which solve the theoretical problem but may lead to a wrong 
interpretation of the shape by introducing artefacts, which do not cor- 
respond to any shape feature. In this paper, an Extended Reeb Graph 
representation (ERG) is proposed, which can handle degenerate critical 
points, and an algorithm is presented for its construction. 



1 Introduction 

Reasoning about shape is a common way of describing real objects in engineer- 
ing, architecture, medicine, biology, physics and in daily life. So far, research 
in modelling shapes has mainly focused on geometry, with the aim of defining 
effective representations and accurate approximations of objects [1]. To describe 
geometric objects, however, different levels of mental models can be used. It 
is possible to use natural-language terms to qualitatively describe the external 
shape of an object, or to draw a sketch of it, or to describe it by listing its dif- 
ferences with respect to some other similar objects, or also to define it according 
to what it is used for, and so on. 

Shape recognition and classification are therefore basic steps for construct- 
ing descriptions of objects. These abstraction processes have been effectively 
described for example for biological taxonomy, where shape descriptions have 
been constructed by finding a good representation and then discarding irrele- 
vant details [2,3]. The important point here is that reasoning about shape is an 
efficient approach not only for building high-level modelling environments, but 
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also for devising top-down simplification methods based on the comprehension 
of the object shape [1]. 

Differential topology and geometry have been often used for surface cod- 
ing, [4,5,6], and building shape descriptions which nicely correspond to intuitive 
mechanisms of shape cognition and recognition. 

For example an interesting method for coding the critical points of smooth 
Morse functions of two variables has been studied by Nackman [4]. Pfaltz pro- 
posed a similar approach for discrete surfaces defining the so-called surface net- 
works [6]. Bajaj et al. [7] propose two different approaches, one bottom-up the 
other top-down, to preserve the topology during the simplification of discrete 
data. Their method, however, does not guarantee an exhaustive description of 
characteristics of the surface. Moreover, it does not allow to extract global prop- 
erties of the surface and to detect degenerate configurations. 

Major research on the use of topology for surface description has been pro- 
posed by Kunii, Shinagawa et al. in [5,8] where a surface coding based on Morse 
theory and Reeb graphs has been defined (see next section). Takahashi et al. [9], 
use an approach based on surface- networks to reconstruct the Reeb graph of 
a 2.bD surface. As the previous references also this approach does not consider 
degenerate configurations which are typical of discrete surfaces, such as plateaux 
or fiat areas. 

In this paper, the Extended Reeb Graph (ERG) representation is proposed, 
which can handle degenerate critical points, and an algorithm is presented for 
constructing the ERG of bi- variate surfaces (scalar fields). The proposed exten- 
sion does not distort the semantic meaning of the Reeb graph and faithfully 
represent the surface morphology. 

The reminder of this paper is organised as follows: first, basic results of Morse 
and Reeb graph theory are given to introduce the proposed method; then our 
extension from the critical points concept to the critical areas is presented for 
discrete surfaces; finally, the proposed ERG representation and an algorithm for 
its construction from a set of contour levels are described. Some examples of its 
application to real surfaces are shown. 

2 Theoretical Background 

Morse theory originates from the calculus of variations and it allows describing 
differentiable manifolds using a limited number of information, for example by 
coding the topological relationships between critical points [10,11]. Using Morse 
theory, it is possible to construct topological spaces equivalent to a given smooth 
surface, which describe the surface as a decomposition into primitive topological 
cells [12]. Some basic results of Morse theory are given. 

Definition 1. (Morse function) Let /, / : M ^ IR, be a real smooth function 
defined on the smooth manifold M ; f is called Morse function if all of its crit- 
ical points are non- degenerate. A critical point is non- degenerate if the Hessian 
matrix H of f is non-singular at that point. 
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Non- degenerate critical points are isolated, therefore scalar fields having 
plateaux or volcano rims do not comply with the definition of Morse function. 

An example of a simple Morse function is given in figure 1(a). The figure 1(b) 
depicts an example of a complex Morse function, i.e. a function having more than 
one critical point at the same level, while figure 1(c) does not comply with the 
Morse’s function definition. 




Fig. 1. An example of height function with an isolated maximum, (a), a non 
simple height function, (b), and a circle of non isolated maxima, (c). In (b) the 
saddle points as well as the maximum points have indeed the same elevation 



The height function can be effectively used to study the surface shape. In- 
tuitively, the height function of a smooth manifold M, embedded into the usual 
three-dimensional Euclidean space, is the real function which associates to each 
point on the surface its elevation. 

Therefore, the level sets of a height function associated to a surface are 
the intersections of the surface with planes orthogonal to a fixed directions, 
which may be shaped in arbitrarily complex ways. If the height function is 
Morse, then the contour configuration can be quite simply classified. Moreover, 
smooth surfaces satisfy the Euler formula which states that the number of non- 
degenerate maximum (M), saddle (p) and minimum (m) points satisfies the 
relation M — p m = 2{1 — g) where g represents the genus of the surface. 

Reeb defined a graph to code the evolution and the arrangement of level 
curves [5,13]. The Reeb graph of a function is defined as follows: 

Definition 2. (Reeb graph) Let f : M ^ JR be a real valued funetion on a 
eompaet manifold M . The Reeb graph of M wrt f is the quotient spaee of MxJR 
defined by the equivalenee relation given by 

(XI, /(XI)) ~ (X2, /(X2)) /(XI) = /(X2) 

and XI and X2 are in the same eonneeted eomponent of f~^{f (XI)) 

A Reeb graph of a compact manifold collapses into one element all points 
having the same value under a real function and being in the same connected 
component. Moreover, theorem 1 allows us to define a further equivalence rela- 
tion among elements of the Reeb’s quotient space. 
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Theorem 1. Let f be a real smooth funetion on a eompaet smooth surfaee M, 
and assume that the segment [a, 6] (where a < b) eontains no eritieal value of f. 
Then: 

— the level set f~^{a) and f~^{b) are diffeomorfie and they eonsist of the same 
number of smooth cireles diffeomorfie to the standard cirele; 

— let Ma be the subset of M defined Ma = {x : f{x) < a}, and Mi) defined 
aeeordingly. Then Ma and M\) are diffeomorfie as two-manifolds with bound- 
ary. 

In figure 2(a) the points drawn on the manifold represent the equivalence classes: 
here the manifold considered is a bi-torus and the function defined over it is 
the height function. In figure 2(b) the Reeb’s quotient space is represented as 
a “traditional” graph where the equivalence classes are grouped into arcs if 
they are representative of diffeomorhc contours, as stated in the theorem 1. 
Obviously, since the choice of the mapping function is not unique, a manifold 
admits different Reeb graphs. 




Fig. 2. A bi-torus, its contour lines and the equivalence classes defined by Reeb’s 
quotient space (a), a possible representation of the Reeb graph wrt the func- 
tion /, (b) 



A Reeb graph describes a manifold surface by considering the evolution of 
the surface sections under a given real valued function. If we have to describe the 
shape of a manifold surface M embedded into the Euclidean space, moreover, the 
Reeb graph of M under its “natural” height function codes the shape description 
of M in terms of critical points of h, corresponding to meaningful semantic labels, 
such as peaks, pits or passes. Moreover, under the assumption that the height 
function is Morse, the structure of the Reeb graph is rather simple. A Reeb graph 
of M under h can be defined as RGh{M) = {Ph{M)^ Ah{M)) where the node 
set is defined by Ph{M) = {Pi G M, Pi is a critical point of h{M)} and the arc 
set is given by Ah{M) = {(P^,Pj), Pi and Pj G Ph{M) and the topologieal type 
of M does not ehange between Pi and Pj } (see Theorem 1). 
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If h{M) is Morse, moreover, the arcs of RGh{M) can be oriented and the 
nodes have almost degree three. 

3 Extended Reeb Graph Representation 

Reeb graph theory provides a powerful abstraction mechanism to describe the 
shape of smooth surfaces. Even if there are no restrictions in the Reeb graph 
definition on the type of /, several authors have in practice limited the use of 
the Reeb graph to Morse mapping functions (see section 1). When dealing with 
discrete surface representations, such as triangular meshes, problems may arise 
due to the loss of properties of the resulting surface which can be assumed to be at 
most continuous. The straightforward application of the Reeb graph definition to 
a generic polyhedral surface (e.g. a 3D triangulated manifold) requires at first the 
definition and extraction of the critical points. The possible definitions of discrete 
critical points, however, generally suffer of instability since small perturbations 
of the vertex coordinates may result in rather different configurations [9,7]. 

This problem is particularly crucial for surfaces defined by measurements of 
some natural or physical objects, as in digital terrain modelling or reverse en- 
gineering, where the position of the mesh vertices may have some uncertainty 
associated. Therefore, we have adopted a different approach based on the com- 
putation of a ’’sufficiently” dense number of contour lines and the definition of 
the Reeb graph from the contour set. The evolution of contour lines is less sen- 
sible to local perturbation of vertex position and, globally, provides a faithful 
description of the surface shape. The same approach has been adopted by Ku- 
nii, Shinagawa et ah, [5,8], for smooth surfaces for which the height function is 
Morse, while here generic continuous surfaces are considered for which a set of 
contour levels is given (or computed). 

An extension of the Reeb graph representation (ERG) is introduced, which 
can handle degenerate and non simple critical points, and an algorithm is pre- 
sented for constructing the ERG automatically from the set of contour levels of a 
discrete surface. The proposed extension does not distort the semantic meaning 
of the Reeb graph and faithfully represents the surface morphology. 

The innovation of this method is both the way of constructing the graph and 
the efficiency in dealing with degenerate situations. The proposed approach is 
actually not an extension of the Reeb graph itself, but rather a full application of 
its definition in the discrete domain, which does not require the height function 
to be Morse. The quotient space defined by the Reeb equivalence relation can 
be represented in terms of a graph representation, which formally represents 
the contour containment and the adjacency relationships. In this way, with an 
extended definition of critical areas, the application domain can be enlarged to 
generic continuous scalar fields, without any artifacts [14]. 

3.1 Definition of Critical Areas and Infiuence Zones 

First of all, to get a more general and unambiguous shape description, we 
extend the idea of critical points to critical areas as well. With respect to other 
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definitions, the idea of critical areas provides a more general approach as it 
takes into consideration the behaviour of a surface in a neighbourhood rather 
than locally around a point. 

Formally, let M be a continuous surface defined by a scalar field f : D C 
IR^ ^ IR, that is, M = {P = (x, y^z)\z = /(x, y)} and let h be the height func- 
tion naturally defined over M, h{P) : M ^ IR such that h{P) = h((x, /(x, y))) 

= f{x,y) and let Pi be the critical points of h{M). Under the assumption that M 
is Morse, f~^{Pi) is an isolated critical point of the surface, while if we do not 
require M is Morse, f~^{Pi) generally represents a critical area of the surface, a 
level region of M. Let C{M) be the set of contour levels of M; we assume that 
C{M) is a set of unorganised polygonal contours, which are either simple closed 
polygons or polygonal lines with the end points on the boundary of the surface. 

Let Ct{M) be the Delaunay triangulation of C{M) constrained to all con- 
tours lines. The critical points of h[M) can be efficiently detected by classifying 
the flat regions of Ct{M), that is, the connected regions of Ct{M) composed 
by triangles having all the three vertices at the same elevation [15,16,17]. 

The flat regions of Ct{M) correspond either to simply or to multiply-con- 
nected areas. Let Br{M) be the boundary of a critical area R of Ct{M)^ in 
general Br{M) = Bi U B 2 U . . . U Bn where Bi represents the i — th con- 
nected component of the boundary. According to the definition of critical area, 
each Bi may be either a contour in C{M), having elevation h{Bi) or may be a 
sequence of edges of as for saddle areas. Simply-connected critical ar- 

eas, for which Br{M) = B\^ correspond either to isolated or degenerate critical 
points of M. A multiply- connected critical area divides the surface into two 
parts: an '^outeP part which is defined by the portion of the surface outside 
the boundary of the multiply-connected area, and as many ^HnneP parts as the 
multiplicity of the critical area boundary. Let B\ be the outer boundary compo- 
nent and Bi, with i > 1, the inner ones. The following classification scheme is 
adopted: 

— Br{M) is a simple maximum area (resp. simple minimum area) iff n = 
1 and the outgoing direction from B\ is descendent (resp. ascending), see 
figure 3(a); 

— Br{M) is a complex maximum area (resp. complex minimum area) iff n > 1 
and the outgoing directions from each Bi are descendent (resp. ascending), 
(see figure 3(b)); 

— Br{M) is a simple saddle area iff n = 1 and there are at least four outgoing 
directions alternatively descendent and ascendent; 

— Br{M) is a complex saddle area iff n > 2 and the outgoing directions are 
descendent for some Bi and ascending for some other Bj or the outgoing 
direction for B\ are either descendent and ascending. 

When considering surfaces defined by scalar fields or more generally manifold 
surfaces with boundary it is necessary to give a unique interpretation to critical 
points occurring along the surface boundary by considering a global virtual min- 
imum point as a virtual closing of the surface with descending directions from 
the boundary to a global minimum. 



Extended Reeb Graphs for Surface Understanding and Description 191 



Obviously, the flat regions of Ct{M) do not always correspond to critical 
points of the height function: for example, ridges and ravines of M will cause 
level regions to appear as well, with a step-like effect which is a very well-known 
problem in digital terrain modelling. These areas, however, can be easily de- 
tected considering the number of the ascending or descending directions of the 
region boundary, as explained in details in [14,17]. Therefore, let us assume from 
now that the critical areas only correspond to critical points of h{M), either 
isolated or degenerate. Generally for polyedral surfaces the Eulero formula given 
in section 2 is not verified. Nevertheless considering the above defined critical 
areas for Ct{M) the Eulero formula is still satisfied if they correspond to non- 
degenerate critical points . 




Fig. 3. Critical areas of M correspond to flat regions in Ct{M): areas of maxi- 
mum are depicted in dark and saddle areas in light grey 



It is important to show the link between critical areas and Reeb graph nodes. 
By applying the definition of Reeb graph (cf. definition 2) all points belonging to 
a simply-connected critical area are Reeb- equivalent and may therefore collapse 
into the same node. If the isolated critical points of M were known, moreover, a 
simple labelling of the graph’s node set would be sufficient to distinguish them 
from degenerate critical points. Similarly, the behaviour of arcs incident to simple 
critical areas is equivalent to the behaviour of arcs incident to isolated critical 
points. Therefore, simple critical areas can be represented in the ERG by simple 
nodes as in the normal Reeb graph representation. 

Multiply-connected critical areas correspond to macro-nodes: that is partic- 
ular nodes having at least one arc connected to an inner node. 

With respect to the Reeb graph arcs we have introduced the concepts of in- 
fluence zones which are defined using adjacency among contours: two contours 
are adjacent if they are edge- adjacent in the constrained triangulation. To con- 
nect critical areas in the Reeb graph, an influence zone is associated to critical 
areas, which intuitively spans the surface between adjacent Reeb graph nodes. 

First of all, if is a simple saddle critical area, its influence zone is defined 
as the surface region delimited by the contours which are directly edge- adjacent 
to i?, in other words contours which are at a distance 1 from the saddle area. 

Then, if is a maximum or minimum critical area, the influence zone of R 
is the portion of the surface identified by growing the outmost component 
of the region boundary, until the boundary of another critical area is reached. 
Therefore, if the critical area is a simple maximum its influence area is the 
maximal region in Ct{M) containing R and non containing other critical areas. 
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Similarly, the influence zone of a multiply-connected area does not contain any 
other critical areas except those located in its internal boundary components. In 
practice the influence zone definition for minimum and maximum areas strictly 
relates with the theorem 1: in fact the expansion process allows to connect level 
sets which are diffeomorhc. For saddle areas, indeed, the influence zone represents 
the portion of the surface where the topological change arises. In figure 4, some 
critical areas are shown with the corresponding influence zones. The dark colour 
identifies maximum (a) and saddle areas (c), their corresponding influence zones 
are depicted in medium grey colour, (b) and (d). 







Fig. 4. Critical areas, (a), (c) and their influence zones (b), (d) 



3.2 Construction of the ERG Representation 

In this paragraph a short description of the algorithm to extract the critical 
areas is given. First of all it is important to notice that each contour Ct{M) 
is edge- adjacent to the previous and the next one, in the height sequence of 
C{M); therefore the ascending/descending directions between contours can be 
simply checked. Then, critical points of h(M) can be efficiently detected by 
classifying the level regions of Ct{M)^ that is, the connected regions of Ct{M) 
composed by triangles having all the vertices at the same elevation [15,16,17]. 
The classification of flat areas as critical areas, simple or complex, is done by 
checking the number of non-constrained edges in the boundary: 

— simple saddle areas are detected in Ct{M) as flat areas with more than one 
edge in the boundary, which does not belong to the constraints, that is to 
the contours; 

— maximum or minimum areas, either simple or complex, and complex saddles 
are detected in Ct{M) as flat areas whose boundary is fully composed by 
constrained edges of Ct{M). 

The distinction among the different types of critical areas is done by analyzing 
the coordinates of the vertices which are edge- adjacent to the region boundary, 
that is, by checking the ascending/descending directions, as in section 3.1. 
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According to the graph representation of the Reeb’s quotient space, each 
node of the graph corresponds to a critical point, or area, and each arc corre- 
sponds to a connected component of the manifold between two critical levels of 
the height function. This suggests to follow a similar approach for automatically 
extracting the Reeb graph, based on tracking the evolution of contour lines. 
First, the critical areas are recognised, and the ERG is initialized by creating 
the node corresponding to the virtual minimum, VM. The UM is connected to 
the saddle (complex or simple) having the minimum elevation and external to 
each macro-node. If such a saddle does not exist, then the VM is connected to 
the first complex maximum area, otherwise the ERG is a trivial graph connect- 
ing the VM to the only simple maximum existing. Then, using the notion of 
influence zone, the Reeb graph arcs are partially computed from the adjacency 
relationships between influence zones. In this manner the arcs connected to the 
terminal nodes of the ERG are identified. To complete the ERG construction 
the links between saddles and, in general, complex areas, have to be determined. 
Intuitively, arcs correspond to ascending paths between critical areas which are 
determined by expanding the associated influence zones, following free direc- 
tions in the outmost boundary component (see figure 5). Free directions are 
those which do not correspond to an already identified arc. 

In the following, the construction algorithm is described using a C pseudo- 
code. At each step, if the critical region is multiply-connected, then a macro-node 
is defined, with as many arcs as the inner components of the critical region. Note 
that the definition of influence zone guarantees that the Reeb graph is correctly 
constructed. 

ERG_Construction(N, A) 

/* The ERG is defined by the set of nodes, N, and of arcs, A */ 

{ N=CriticalAreasRecognition(tin, contours) ; 

/* Identify critical areas and initialize the virtual minimum */ 
OrderAreas (N) ; /* Order the Critical Areas by elevation */ 

Inf luenceZoneDetection(N) ; 

/* Influence zones are associated to saddle areas */ 
and then to maximum and minimum critical areas */ 
LinkNodesbyInf luenceZone(N, A) ; 

/* Create a subset of A directly by checking the adjacencies 
between Influence Zones */ 

CompleteArcSet (N, A) ; } 

For the sake of clarity the function “CompleteArcSet” is here expanded: 

CompleteArcSet (N , A) 

/* N=nodes, A=arcs */ 

{for (each node in N) 

{if (IsGrowingArea(node) ) 

{for ( each non visited growing direction node) 

{while ( (not (f indBoundarySurf ace) ) or 

(not (f indOtherInf luenceZone) ) ) 
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ExpandToUpperLevel(node) ; 
if (R=OtherAreaReached) 
ConnectWithArc (node , R) ; 
} } > } 



The function “IsGrowingArea(node)” returns a boolean value which is TRUE 
if the critical area has at least one growing direction which has not been visited 
yet. 




(a) (b) 

Fig. 5. Two steps of the Reeb graph reconstruction process 



Some examples are given: in figure 6(a), a test surface is depicted with the 
critical areas identified on the triangulation; its Reeb graph is depicted in 6(b) 
and 6(c). The surface that can be trivially reconstructed by triangulating only 
the critical sections of the ERG is shown in 6(d): even if rough, the reconstruction 
still faithfully reproduces the original surface morphology. Finally, in figure 7, 
another example of our characterisation method applied to a natural surface (a) 
is shown, then the reconstructed Reeb graph is depicted in (b) and (c) and the 
surface reconstructed considering only the critical sections is shown in (d). 

4 Conclusions and Future Developments 

In this paper a model has been defined which provides a high-level description of 
the surface shape. Morse theory concepts and Reeb graph properties have been 
used and translated to the discrete domain in order to define a sketch of the 
surface usable for classification or compression and decompression purposes [18]. 

With respect to previous work in this field, the proposed Extended Reeb 
Graph is able to faithfully represent the surface shape, even in case of degen- 
erate critical points. The algorithm for constructing the ERG of scalar fields, 
represented as contour lines, is efficient and allows a topologically correct recon- 
struction of the surface shape. 
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Fig. 6. A test surface (a), its Reeb graph (b) and (c) and the reconstructed 
model (d) 



Fig. 7. The critical areas of a terrain (a), its Reeb graph (b), (c) and the re- 
costruction from the critical sections (d) 
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Future developments are currently under consideration, mainly for the ex- 
tension of the method to three dimensional surfaces, with or without boundary. 
Moreover, the shape decompression process is also being developed, taking into 
account several shape reasoning steps which allow a better definition of shape- 
based generation of intermediate sections between critical sections, coded in the 
Reeb graph [18]. 
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Abstract. In this work we describe a geometric method to smooth the 
boundary of a discrete 3D object. The method is reversible in the sense 
that the discrete boundary can be retrieved by digitizing the smoothed 
one. To this end, we propose a representation of the boundary of a dis- 
crete volume that we call Euclidean net and which is a generalization to 
the three-dimensional space of Euclidean Path introduced by Braquelaire 
and Vialard [4]. Euclidean nets can be associated either to voxel based 
boundaries or to inter- voxel based boundaries. In this paper we focus on 
the first approach. 

Keywords: Digital Surfaces. Discrete boundary representation. Smooth- 
ing. 



1 Introduction 

Processing three-dimensional discrete images such as X-ray computer tomogra- 
phy images or MRI brings up the problem of defining and representing three- 
dimensional discrete objects [11]. Usually a three-dimensional diserete image (in 
short 3D image) is a parallelepipedic matrix of voxels and a three-dimensional 
diserete objeet (in short 3D object) of a 3D image is defined as a set of con- 
nected component of voxels of this image. In this work, we call diserete objeet, 
or simply objeet, any 6-connected set of voxels. For instance a raw visualization 
of a discrete sphere (a maximal object which voxel centers are inside a Euclidean 
sphere), also called cuberille representation [10], can be seen in Fig 1 (a). 

There are several motivations to build an effective representation of the 
boundary of a 3D object. One of them is that a boundary representation fits 
with conventional rendering techniques. Another one is that the encoding of a 
3D object with a matrix of labels does not provide a convenient framework for 
image analysis applications such as the extraction of geometric and topological 
feature or object recognition. 

A first approach, known as the marehing eubes algorithm [13], provides a 
method for extracting a polygonal surface mesh from the object. Each polygon 
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is defined according to a local configurations of height adjacent centers of voxels. 
This method is easy to implement but has several drawbacks both from the 
geometrical and the topological points of view. A second approach known as 
active surfaces consists in moving the control points of a parametric surface 
enclosing the digital object in order to minimize an energy function [20,21]. 
With both these approaches the boundary of the discrete object is described by 
a continuous surface defined in the Euclidean space 

An alternative method is to use the digital surface (or discrete surface) of the 
object. Two approaches are commonly used : the voxel based approach [15,7,14,22] 
and the inter-voxel based approach [17,8,4]. It has been shown that the inter- voxel 
surface can be seen as a marching-cube algorithm define with an alternative in- 
terpolation scheme [12]. 

Several methods have been proposed to associate a surface mesh with the 
discrete surface of an object. For instance the set of centers of surface voxels can 
be triangulated by linking together some adjacent centers [24,22]. In Fig.l (b) 
is displayed a wire-frame representation of the surface mesh computed by the 
method proposed by Thuermer [22], and in Fig.l (c) is displayed a surface ren- 
dering of this surface mesh. The aspect of the rendered surface mesh is smoother 
than the cuberille representation but is still far from the aspect of a continuous 
sphere. A usual approach to improve the visualization of a digital surface is to 
estimate a normal at each point of the discrete surface and to perform a shading 
(for instance a usual Gouraud shading [9]). 

Here we propose an alternative approach based on a geometrical smoothing 
of the digital surface. In general a discrete 3D image comes from the digitizing 
of a continuous 3D scene. In this context a discrete object is also the result of 
the digitizing of a continuous one. It is thus natural to try to retrieve from the 
boundary of a discrete object an approximation of the boundary of the under- 
lying continuous object. The goal of the geometric smoothing is to approximate 
this underlying continuous boundary by performing local movings of the sur- 
face points. An example of geometric smoothing of a discrete sphere is shown 
in Fig.l (d). Each point of the triangulated discrete surface has been moved 
according to a method described in the following of the paper. 

This method is a generalization to the three-dimensional case of the method 
of Euclidean paths introduced by Braquelaire and Vialard [27,4] to smooth the 
discrete boundary of a discrete 2D region. A Euclidean path is associated to a 
discrete 4-connected or 8-connected path by moving each point of the discrete 
path from its position to any position of the open unit square centered on it. A 
method has been proposed to compute the moving of each point of the discrete 
path laying on the computation of a discrete tangent at each of these points. It 
has been shown that so called tangent driven Euclidean paths highly improve the 
visual aspect of a discrete region boundary and provides a convenient represen- 
tation for geometric transformations and a good estimation of geometric features 
(tangent and normal, perimeter) [25]. The smoothing preserve the topology of 
the set of boundaries of a segmented image [1]. This method has been successfully 
used to solve some image processing problems [3,26]. 
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Fig. 1. Four representations of a discrete sphere: (a) cuberille representation, (b) 
wire-frame representation of a surface mesh associated with the discrete surface, 
(c) surface rendering of the surface mesh, (d) geometric smoothing of the discrete 
surface 



The aim of this work is to propose a similar method to smooth the discrete 
surface of a 3D object. A geometrical analysis is performed at each point of the 
discrete surface providing an estimation of a tangent plane. The point is then 
projected onto this plane. This method is reversible in the sense that the discrete 
surface can be retrieved by digitizing the smoothed one. Moreover, this method 
computes a canonical result and does not need any manual parameterization. 

In the following section we introduce discrete and Euclidean nets which are 
the 3D analogous of 2D discrete and Euclidean paths. In Section 3 we describe 
the model of tangent driven Euclidean nets. In Section 4 we address the problem 
of computing both a tangent plane an a normal vector to a point of a discrete 
surface. Einally in Section 5 we present both qualitative and quantitative results 
of experiments of geometric smoothing. 

2 Discrete and Euclidean Nets 

Let us now give some basic definitions used in the following. The coordinates 
of a point P are denoted by the tuple {xp^yp, zp). The set of points with 
integer coordinates is called the diserete spaee and its elements are called diserete 
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points. We denote by upper case letters the points of the discrete space I? and 
in lower case letters the points of the Euclidean space 

A voxel is a colored unit cube the center of which is a discrete point. The 
coordinates of a voxel are the coordinates of its center. An image is a set of 
voxels and the image domain is the set of voxel centers. In the following we 
consider images the domain of which is a parallelepipedic set of discrete points. 
The voxel based approach has been chosen to define the boundary of an object. 
More precisely, the diserete surfaee of an object V is the subset S(V) of V such 
that each voxel of S(V) is 6-connected to at least a voxel of the complement 
of V in the image. A voxel of *S'(V') is called a surfaee voxel of V. 

The following definitions are extensions for the three dimensional case of the 
definitions used in the model of Euclidean paths [4] . 

Definition 1. Let P = {xp^yp^ zp) be a diserete point. The cell of P is the set 
of points p o/R^ verifying: \xp - Xp\ < \yp - yp\ < \zp - Zp\ < 

The cell of a discrete point P is thus the open unite cube centered on P. 

Definition 2. Let P be a diserete point. We say that p is a Euelidean point 
assoeiated with P if p belongs to the eell of P. 

In two dimensions it is natural to define the boundary of a discrete region 
by a discrete path (the nodes of the path being either pixel or inter-pixel posi- 
tions). In three dimensions the analogous of a discrete path is a graph of discrete 
point which edges link neighboring elements of the boundary according to the 
considered topology. This graph describes the polygonal decomposition of the 
boundary. Since we consider here voxel based boundaries the edges of this graph 
link centers of adjacent boundary voxels. We do not focus in the following on the 
definition and the construction of this graph (any topological consistent defini- 
tion can be used). Our experiments have been done by using the decomposition 
described by Thuermer [22]. 

Definition 3. A discrete net is a graph assoeiated whieh a eonneeted eomponent 
of the polygonal deeomposition of the boundary of a 3D objeet. A Euclidean net 
assoeiated with a diserete net N is the graph obtained by replaeing in N eaeh 
diserete point by an assoeiated Euelidean point. 

The transform from a discrete point to an associated Euclidean one can 
be reversed simply by rounding the coordinates of the Euclidean point. The 
transform from discrete nets to Euclidean nets is thus also reversible. Since there 
is an infinity of Euclidean nets that can be associated with a discrete one, we 
must now define a strategy to associate a discrete point with to a Euclidean one. 
Recall that the goal of the geometric smoothing is to approximate the underlying 
continuous boundary. Good results were obtained in the 2D case by projecting 
the discrete point on the support of an arithmetic line tangent to the boundary. 
In the following section we describe a analogous 3D method based on arithmetic 
digital planes introduced by Reveilles [16]. 
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3 Tangent Plane Driven Euclidean Nets 



Let us first recall the following definition: 

Definition 4. [16] A discrete naive plane fin short discrete plane) is a set of 
points (x, z) of 1? satisfying the double inequality jj, < ax by cz < ja + u;, 
with a, b^c^ fi^uj ^ Z and uj = max(|a|, |6|, |c|). 

The coefficients (a, 6, c, /i) are called the characteristics of the the discrete plane. 
The vector of coordinates (a, 6, c) is the normal vector of the discrete plane. The 
coefficient fi describes the position of the discrete plane. 

Definition 5. Let V be a discrete plane of characteristic {a,b,c,ix). The lower 
and upper leaning planes of V are the real planes defined respectively by the 
equations ax by -\- cz = ju and ax -\- by + cz = y uj — 1. 

Definition 6. Let V be a discrete plane of characteristic (a^b^c^ix). The cen- 
tered plane ofV is the real plane V defined by the equation ax-\-by-\-cz = 

The upper and lower leaning planes of a discrete plane V bound the set of 
Euclidean points which can be associated with V. Thus the centered plane is 
an average plane among all the real planes associated with V. We suppose for 

the rest of this paper that 0 < a < 6 < c and 0 < c. Thus we have uj = c. The 

th 

normal vector of such a discrete plane belongs to a 48^^^ of space called standard 
simplex [6]. The other cases can be obtained by symmetry. 

The strategy used to associate a Euclidean point p with a point P of a discrete 
net can be summarized as follows : 

1. Select a discrete plane V{P) containing P. 

2. Project P onto the centered plane of P{P) such that the projected point p 
satisfies the conditions of Definition 1. 



There are several ways to select the plane P{P). It seams natural to try to 
find the discrete plane which is the better approximation of the Euclidean plane 
tangent to the underlying real boundary. We address the problem of the selection 
of a discrete tangent plane in the following section. Eor now we call tangent plane 
at P any discrete plane that contains P. 



Definition 7. Let P be a discrete point and let P be a discrete plane of charac- 
teristics {a^b^c^ii) and containing P. LetP the centered plane ofP. The canon- 



ical projection of P according to P is the point p of P satisfying Pp = a 




Erom Definition 7 we have Xp = aa^ yp = ab^ and Zp = ac. Since p ^ P and by 
considering that P is the origin of the coordinate system we have aXp+byp+cZp = 
/i-h and thus aa^ ab‘^ ac^ = /i+ Since we have supposed that c > 0, 
we finally get: 



a = 



H- 6^ + ^2 



( 1 ) 
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Theorem 1. According to the previous definitions the canonical projection of a 
point P is a Euclidean point associated with P. 

Proof. See appendix A 

4 Selection of the Tangent Plane 

The previous result states that the canonical projection of P on the centered 
plane of a discrete plane containing P is a Euclidean point associated with P. 
We have now to define a strategy to select a plane P{P) at each point of the 
discrete net. Several method have been proposed to estimate a tangent plane at 
a point of a discrete surface. In this work we have retained an approach based 
on the analysis of local configurations called tricubes. This choice is motivated 
by the fact that tricubes provide a simple and robust way to estimate a dis- 
crete tangent plane. Of course it would have been interesting to consider more 
general configurations like for instance (n, m)-cubes [29]. But in this work we 
have focused on the problem of providing a framework for reversible geometric 
smoothing in which any tangent plane method can be used. Improvement of 
methods of determination of tangent plane is out of the scope of this work. 

Tricubes were introduced by Debled [6] and are the topic of several recent 
works [5,18,29,28]. A point P of a digital plane P has exactly height neighbors. 
The tricube of P is then the set of P and of its height neighbors. The point P 
is the center of the tricube. In the standard simplex there are exactly 40 config- 
urations of tricubes. 

For each point P of the discrete net we search for local configurations of 
tricubes centered at P and included in the net. When considering not only the 
standard simplex but the whole discrete space, some points of the discrete net 
can be associate with more than one center of tricube. There is an example 
of such a case in Fig. 2. When such a case arises we compute separately a 
Euclidean point for each tricube and we retain as Euclidean point the average 
of these points. Conversely there are configurations where some discrete points 
cannot be associated with any tricubes. For instance, there are only 58% surface 
voxels of a discrete sphere of radius 3 that can be associated with a tricube. This 
amount is of 84% for a sphere of radius 9, and is close to 100% for spheres which 
radius greater than 10 voxels. When such a case arises, the point may be either 
left unmoved or treated during a second step. If the second option is retained 
the associated Euclidean point is computed by interpolating Euclidean points 
associated with neighbors of the unmoved point. 

The example shown in Fig. 1(d) has been computed with this method. Of 
course, the quality of the smoothing highly depends on the quality of the method 
used to compute the tangent plane. The method used in this work is based on the 
analysis of small neighborhoods. The geometrical smoothing can be improved by 
improving the tangent plane recognition method. 
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Fig. 2. The dark voxel of the discrete surface (a) can be associated with two 
centers of tricube (b) and (c) 



5 Experiments 

In this section we give some quantitative and qualitative results of experiments 
performed with the method of tangent plane estimation described in the previous 
section. 

Let cr be a Euclidean sphere of radius r and *5^ the associated digitized sphere 
(the set of voxels which the center is located inside the Euclidean sphere). Eor 
each point P of 5 we denote by p{P) the distance from P to the center of the 
sphere (we suppose that the center C of the sphere cr is a discrete point). We 
define the maximal geometrical error (resp. average geometrical error) by the 
maximum (resp. average) of p{P) for all the discrete points P of the boundary 
of S. In Eig. 3 (a) are displayed maximal and average errors for discrete spheres 
from radius 2 to radius 45 before and after smoothing. Discrete points that do 
not fit with a tricube are smoothed by interpolation. Both maximal and average 
errors are reduced by half. 

In Eigure 3 (b) is given the result of experiments of normal estimation. The 
normals have been estimated according to both the discrete net and the Eu- 
clidean associated net. The normal at a point of the net is computed by av- 
eraging the normals vectors of the adjacent facets. Eor each point p of either 
the discrete net or the Euclidean associated net the error on the normal is the 
angle between the vector Cp and the estimated normal vector. Both maximal 
and average errors are given for spheres from radius 2 to radius 45. The average 
error with the Euclidean net is lower than three degrees. The maximal error is 
about ten degrees for small radius and tends toward fifteen degrees when the 
radius increases. 

Naturally these results depend on the method used to determine the tangent 
plane and can be improved by using a more global method than tricubes. Eor 
instance an idea to improve the estimation of the tangent plane would be to 
consider stochastic methods using disconnected configuration of (n, m)-cubes 
((n,m)-cubes [29] are configurations that generalize tricubes). 

Some examples of geometrical smoothing can be seen in Eig. 4. Eor each 
example the image of the first column is a fiat shading of the facets of the 
discrete net and the image of the second column is a fiat shading of the facets of 
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Distance in pixels Angle in degrees 





(a) (b) 

Fig. 3. (a) Maximal (dotted lines) and average (plain lines) geometrical error 
computed on discrete (thin lines) and smoothed (thick lines) spheres, (b) Maxi- 
mal and average error of the estimated normal vectors of discrete and smoothed 
spheres 



the associated Euclidean net. The three first rows show scenes made of regular 
objects (respectively a cube slightly rotated around one of its axis, a sphere of 
radius 20 and a combination of a plane with a sphere. The examples of the two 
last rows are computed from digitized 3D objects provided by the Labomtoire des 
Sciences de Vlmage, dlnformatique et de of Strasbourg University. 

There are still visible staircase on the smoothed images that could be attenu- 
ated by improving the selection of the tangent plane. Nevertheless it is important 
to emphasize that the images of the second column show the result of geomet- 
ric smoothing without rendering improvement. The displayed surface is exactly 
the smoothed surface and can be explicitly used for extract geometric features. 
Rendering methods based on a normal estimation [23,19] can give better visual 
results. On the other hand they does not provide an explicit geometric smoothing 
of the boundary. 

6 Conclusion 

In this work we have described a new approach to perform a discrete geometric 
smoothing of the boundary of a 3D object. This method extends to the three- 
dimension case the method of Euclidean paths. Each center of voxel P of the 
boundary is associated with a Euclidean point p located inside the open unit 
cube centered on P. Given a naive plane containing P, the Euclidean point p is 
the canonical projection of P on the centered plane of the naive plane. By this 
way a Euclidean net of Euclidean points is associated with the discrete boundary. 

Any definition of tangent plane can be used to computed canonical projec- 
tions. Thus the accuracy of the Euclidean net construction (and of the smooth- 
ing) highly depends on the accuracy of the method used to compute the tangent 
plane. In this work the tangent plane at a boundary point is estimated by an- 
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Fig. 4. Flat shading of the discrete net (first column) and of the associated 
Euclidean net (second column) of 3D discrete objects : a cube (first row), a 
sphere (second row), a combination of a plane and a sphere (third row), and 
digitized real objects provided by the LSIIT (fourth and fifth rows) 
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alyzing tricubes configurations centered on this point. Thought this method is 
limited to a local analysis of the boundary around the point it gives promising 
results. The geometric smoothing must now be experimented with other meth- 
ods of determination of tangent plane. Both quantitative and qualitative results 
could then be improved. 

We are currently working on Euclidean nets of inter- voxel boundaries. An 
advantage of inter- voxel boundaries is that they provide a simple and consistent 
representation of the topology of the object they bounds (the discrete net is 
trivially obtained from the data structure). On the other hand from a geometrical 
point of view the inter-voxel boundaries provide a quite poor representation of 
the shape of the object. In that case the relevance of the Euclidean net approach 
is thus increased. 

The method has also to be adapted to the case of a non-manifold boundary 
resulting from the segmentation of an aggregate of 3D discrete objects [2]. Finally 
we think that Euclidean nets will provide and efficient framework to extract 
geometrical parameters from a 3D object as the method of Euclidean path does 
for 2D regions [25]. 
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A Proof 

We have to verify that the canonical projection p satisfies the condition of Defi- 
nition 2. We can suppose that the origin of the coordinates system is the discrete 
point P. Since a < 6 < c, in order to show that the point 

^ f + ^ + ^ + ^ A 

^ \ -h -h H- 6^ -h + h‘^ J 

belongs to the unit cell centered at P it suffices to verify that: 

C ^ + ^ <1 

2 

The discrete tangent plane V is the set of points satisfying p < axPhyPcz < 
/i + c — 1. From the hypothesis the point of coordinates (0,0,0) belongs to P. 
Thus we have from the definition of discrete plane /i < 0 and thus 

/i < 0, 

2/i < 1, 

2/i + c — 1 < c, 
c(2/i + c — 1) < 

a2 + ft2 + c2 2{c 

Finally, since P P > (? 

/i T 1 

^ o? Pb‘^ P ^ 2 ’ 



since c > 0, 



^2 pb‘^ p c^) 



since Pb^ P > 0. 



we have and thus we get: 



By a symetrical argument from 0</i + c— Iwe get: 
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Abstract. In this paper we study discretizations of objects in higher 
dimensions. We introduce a large class of object discretizations, called k- 
discretizations. This class is natural and quite general, including as spe- 
cial cases some known discretizations, like the standard covers and the 
naive discretizations. Various results are obtained in the proposed gen- 
eral setting. 

Keywords: Object discretization. Supercover, Standard cover. Naive 
discretization, /c-Discretization 



1 Introduction 

With the present paper we undertake an investigation aimed at providing a 
general framework for developing a discrete geometry of surfaces and bodies of 
arbitrary dimension. We define and study certain discretizations that generalize 
some well-known discretization schemes, applicable mainly to linear objects (e.g., 
hyperplanes). Among them, one can list some recent studies on supercovers and 
standard covers of hyperplanes and affine subspaces [3,1,2], as well as earlier 
discretization schemes (see, e.g., [18,20]). 

One may expect that the theoretical and practical worth of multidimensional 
discrete geometry will be increasing. An argument in support of such an expec- 
tation is provided by some recent applications. In fact, 4-dimensional imaging 
is already existing, due to, e.g., PET scans and other dynamic medical images. 
Further applications are to be expected in multimedia where hypertexts together 
with images, motion video and sound are manipulated. Another motivation is 
inspired by mathematical arguments. For every Euclidean notion we may have 
numerous corresponding discrete notions, depending on the applications and 
the Euclidean property we want to preserve. The great diversity of possible 
discretizations is due to their discrete nature. If we consider objects and proper- 
ties (especially topological properties) only in 2D or 3D, we will not necessarily 
choose definitions that would have natural extensions in higher dimensions later 
on, in case we need to go to higher dimensions (for animation, for instance). In 
other words, a geometry that is not coherent in all dimensions cannot serve as a 
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good ground for future developments. More general results in higher dimensions 
will give insight to the practically important 2D and 3D versions of the theory, 
and may facilitate the deeper understanding of the matter. Thus, developing a 
reasonable n-dimensional discrete geometry is not simply an academic exercise, 
but is dictated by real needs. 

We rely implicitly on the existing knowledge in the digital topology (see, e.g., 
[15,16,14,8]). In particular, we conform to certain widely adopted terminology, 
which we briefly recall in Section 2. In Section 3 we define a minimal cover of a 
body and study some basic properties of such covers. In Section 4 we introduce 
the fundamental concept of k- discretization and k- cover of surfaces. We also 
expose a relation between the /c-discretizations and some other discretizations 
based on Hausdorff distance [21,19]. In Section 4.3 we present some results about 
connectivity of /c-discretizations, while in Section 4.4 we illustrate some of the 
new notions and results in the case of hyperplanes and halfspaces. In particular, 
the 0-discretizations of hyperplanes appear to be standard discrete planes. The 
latter have been studied in the framework of the arithmetic geometry [2,17]. 

2 Preliminaries 

We recall some basic notions to be used in the sequel. To obtain more details 
about some of them, the reader is referred to [9]. 

A discrete (resp. Euclidean) point p is an element of (resp. R^). Through- 
out we will assume that n > 2. By pi we denote the Th coordinate of p. A 
discrete (resp. Euclidean) object is a set of discrete (resp. Euclidean) points. 

Let p e Z^. A voxel with center p is the unit cube V{p) = [pi — + 

|] X ... X [pn“ |,Pn + ^]- A j-dimensional facet of V{p) will be referred as to 
a j-facek 0 < j < n^. We also define V{A) = Up^AE(p), i.e., V{A) = {x G : 
X ^ V{p) for some p G A}. 

The Euclidean points p = (pi, • • • ,Pn) and q = (<?i, • • • , <?n) are said to be k- 
neighbors^ with 0 < k < nA^Pi~Qi\ ^ 1 for 1 < i < and k < n~Y^^=i \Pi — Qi\- 
A k-path in a discrete object A is a sequence of discrete points from A such that 
every two consecutive points are /c-neighbors. A is called k- connected if there 
is a /c-path connecting any two points of A. A k- component is a maximal k- 
connected subset of A. 

Let D he 8i subset of a discrete object A. If A — D is not /c-connected then 
the set D is said to be k- separating in A. 

Now we introduce some other notions. Let M C R^ be a Euclidean object. 
Eor technical convenience throughout the paper we will assume that M is a 
closed set. A point p G R^ is called boundary point for M if it is neither internal 
nor external for M. The set of all boundary points for M is called boundary of M 
and denoted Bd{M). Since M is closed, Bd{M) C M. 

A cover of M is any set of voxels C{M) whose union contains M. The su- 
percover of M is the set S{M) of all voxels which are intersected by M. Clearly, 

^ Thus the 0-facets are the voxel vertices, the 1-facets - its edges, the (n — l)-facets 
are the voxel faces, and the n-facet is the voxel itself. 
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a) 




b) 



Fig. 1. a) Example of an object that is a body in R‘^. The voxels vi and V 2 are 
osculant for the object, b) Example of an object that is not a body. It has no 
osculant voxels 



each of the following inclusions is possible for a cover C{M) and a supercover 
S{M): (i) S{M) C C{M); (ii) C{M) C S{M). In our study we will be interested 
in covers of Type (ii), which are “tighter” than or identical to supercovers. We 
will consider covers and other discretizations that are not necessarily covers, of 
Euclidean sets with topological dimension h, where 0 < h < n. In particular, 
we will consider covers of bodies. We will say that a closed set B C is di 
body if it is simply connected^, has a topological dimension dim{B) = n, and 
for every point p G Bd{B), B — {p} is a simply connected set. For example, 
the set in Figure la is a body in while the one on Figure lb is not. We 
will also study discretizations of surfaces of different dimension, in particular, 
(n — 1) -dimensional surfaces. We will suppose that the surface is given in explicit 
form T = {x G R^ : Xn = /(xi, . . . ,x^_i)}, where / is a continuous function 
defined on R^~^. 

We will call a voxel V (p) osculant for a Euclidean object M if V{p) n M is 
contained in a j-facet of V(p), for 0 < j < n (Figure la). An n-bubble in R^ is 
a set of 2^ voxels sharing a common vertex Q. Equivalently, an n-bubble can be 
considered as the supercover of a half- integer point, namely the point Q in the 
alternative definition. 

We conclude this section with the following lemma, which will be used in the 
next section. 

Lemma 1. Let M C be an arbitrary connected Euclidean set. Then S{M) 
is (n — 1) -connected. 

A proof can be obtained as a straightforward generalization of the proof for 
the case n = 3 [9]. For a detailed accounting on various other properties of 
supercovers we refer to [1]. In particular, it is well-known that a supercover is 
not, in general, 

^ We recall that a set M C R^ is called simply connected if it is homeomorphic to the 
unit ball in R^. 
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3 Minimal Covers 

In this section we consider a natural and important class of covers. 

Definition 1. Let M be a Euelidean objeet. We eall its eover C{M) minimal if 
there is no other eover of M that is a proper subset of C{M). 

Next we study some properties of the minimal covers. 

Proposition 1. If a eover of an objeet M does not eontain any oseulant voxel, 
then it is minimal. 

Proof Let C{M) be a cover of M that does not contain osculant voxels, and 
let V be an arbitrary voxel from C{M). By definition of osculant voxel, the 
intersection My = v n M is not contained in a facet of v, and therefore is also 
not contained in a facet of any other voxel. Let us remove v from C[M). The 
remaining set of voxels C[M) — {'c} does not contain My, since a set cannot 
be contained in two distinct voxels at the same time, unless it is contained in a 
common facet of theirs. Thus C{M) — {r’} is not a cover of M. Hence the cover 
C{M) is minimal. □ 

In the case of bodies, the condition of the above theorem completely charac- 
terizes the minimal covers, as the following theorem suggests. 

Proposition 2. A eover of a body (not neeessarily bounded) is minimal if and 
only if it does not eontain any oseulant voxels. 

Proof One direction of the proof follows from Proposition 1. We prove the 
other direction. Let 5 be a body and C{B) its minimal cover. Assume that 
C{B) contains a voxel v which is osculant for B. Then we have By = vHB C F, 
where T is a facet of v. 

If C{B) contains another voxel v' ^ By, then C{B) — {t’} is still a cover of B 
and thus C{B) is not minimal. Let us assume that By is not contained in any 
voxel of C{B) other than v. Note that By cannot be a single point. The opposite 
would be possible either if By belongs also to another voxel (one containing the 
rest of By or a part of it) or if B = By, which is impossible since 5 is a body. 

We distinguish between two cases. Let us first assume that By = B. Since By 
is contained in a facet of v with topological dimension less than or equal to 
n — 1, the same follows for the dimension of B, which contradicts the fact that B 
is a body. The other possibility is B = By U B' for some set B' ^ such 
that ByHB' = 0. Since By and B' are disjoint and B is simply connected. By is 
simply connected, as well. Since 0 < dim(5^) < n — 1, every point of By belongs 
to Bd{B). Then there will be a point p G By such that the set By — {p} is not 
simply connected, which contradicts the definition of body. □ 

Proposition 3. A body B has a unique minimal eover. 

Proof Consider the supercover S{B) of B, and let us remove from it all voxels 
which are osculant for B. Having in mind the proof of Proposition 2, we can 
conclude that the obtained set of voxels is a minimal cover, which is clearly 
unique. □ 
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Theorem 1. The minimal cover of a body B is {n — 1) -connected. It can he 
obtained from the supercover of B by removing all osculant voxels for B. 

Proof Let S{B) be the supercover of a body and let us remove from it all the 
voxels that are osculant for B. Denote the resulting set of voxels by C{B). By 
Propositions 2 and 3, C{B) is the unique minimal cover of B. We will show that 
V{B) is (n — l)-connected. For this, let us consider the set Cl{Bs) C which is 
the closure of the set B^ obtained from B by excluding the union of all the closed 
balls with diameter e which are contained in B and have a single common point 
with the boundary of B (see Figure 2). Since 5 is a body, for e small enough 
(e.g., 5 < 1/m for enough large integer m) B^ will be a body as well, and thus 
a connected subset of . It is also clear that for the so defined e, the set C{B) 
will be a supercover of B^. Then, by Lemma 1, C{B) is (n — l)-connecded as a 
supercover of a connected set. □ 
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Fig. 3. a) The curve has two minimal covers: {ni,U 2 ,n 4 } and {vi^vs^v^}. Each 
of them contains a voxel that is osculant for the curve {v 2 and us, respectively). 

b) The segment on the figure has a minimal cover that is not (n — l)-connected. 

c) The minimal cover of the curve contains a 2-bubble 



With surfaces. Propositions 2, 3, and Theorem 1 do not hold, in general, as 
illustrated in Figures 3a and b. A minimal surface cover may contain osculant 
voxels (Figure 3a), as well as bubbles (Figure 3c). 
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Fig. 4. Illustration to Definition 2. In dimension 2: al) 0-tunnel; a2) 1-tunnel. 
In dimension 3: bl) 0-tunnel; b2) 1-tunnel; b3) 2-tunnel 



4 Surface Discretizations 

In this section we are concerned with discretizations of surfaces of different 
dimensions. We introduce and study a large class of discretizations called k- 
discretizations. For this purpose, we first introduce a definition of tunnels in 
discrete objects. 

4.1 Tunnels in Discrete Objects 

Bellow we suggest a definition of tunnels, applicable to a broad class of discrete 
objects. Let A ^ he a, discrete object, such that the set — V{A) is 
connected. (This condition ensures that there are no “caves” in V{A).) 

Definition 2. If the set V{A) is not simply eonneeted, we postulate that A has 
a j -tunnel, for every 0 < j < n — 1. 

0-tunnels.* If n = 2, then A has a 0-tunnel if A eontains two voxels whose 
interseetion is their eommon vertex and no other voxel from A shares the same 
vertex (Figure 4 clI)- If then A has a 0-tunnel either if A eontains two 

voxels whose interseetion is their eommon vertex and no other voxel from A 
shares the same vertex, or if there is a voxel v ^ A and a vertex P of v with 
P G Bd(V{A)), sueh that V{A) — {P} is not a simply eonneeted set (Figure 4 
bl). 

j-tunnels, 1 < j < n — 1: A has a j -tunnel if there is a voxel v G A and 
a j-faeet F of v, sueh that F C Bd{V{A)), and V{A) — {Q} is not a simply 
eonneeted set, for every point Q belonging to the relative interior^ of F . Thus 
if A is diseonneeted, it has (n — 1) -tunnels. (See Figure 4 a2,b2,b3.) 

A discrete object without /c-tunnels is called k- tunnel- free. 



4.2 Aj-Discretizations and /^-Covers 

We start with recalling the definition of Hausdorff distance. Let be a metric 
space with metric d, and S a family of closed non-empty subsets of E. For every 
X ^ E and every A ^ 8 lei d{x, A) = mf{d{x, y) \ y ^ A}. Then, given two sets 

^ Relative interior of a j-facet F is the interior of F, considered as a subset of Rf 
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a) b) 

Fig. 5. Illustration to Definition 3. a) 1-Discretization, b) 0-Discretization. It 
can be obtained from the 1-discretization by adding the voxels vi and V 2 



A^B ^ Hd{A^ B) = max{sup{d(a, B) : a G A}, sup{d(A, b) : b G B}} is called 
the Hausdorff distance between A and B. 

Let a surface B C be given, with topological dimension h, 0<h<n — 1,^ 
i.e., r = {x G . . . , t/i), 1 < i < n, G i?, 1 < j < h} where fi are 

continuous functions of the variables ... Ah- In a very general sense, we will 
consider every subset of the supercover S{B) as a discretization oi B. Of interest 
for us are discretizations that are (n — 1) -tunnel- free (i.e., without “holes”). Let 
DX) C S{B) be a /c-tunnel-free discretization of B. We will call a discrete 
point p G D{B) k- simple^ if D{B) — {p} is still A;-tunnel-free. If a A;-tunnel-free 
discretization D{B) does not contain /c-simple points, we will call it k- minimal^ . 
Now we are able to give the following definition. 

Definition 3. Let S{B) be the supereover of B, and Vk{B) the family of all 
subsets of S{B) that are k -minimal, for some 0 < k < n — 1. We will eall a 
set of voxels Dk{B) G Vk{B) a /c -discretization of B if it is (n — 1) -tunnel- free 
and the Hausdorjf distance Hd{B,V{Dk{B))) is minimal, over all the elements 
ofVk(B). 

Further we consider mainly the case when d is the Euclidean metric in R^. Some 
illustrations to Definition 3 are presented in Figure 5. We remark that in this 
definition the supercover of a surface is used as a natural upper bound for the 
amount of voxels included in a /c-discretization. 

Clearly, forl<A;<n — l,a /c-discretization of a surface is not necessarily 
a cover of its, while a 0-discretization always is. If B B Dk{B), we call the 
discretization Dk{B) a k-eover of B. One can observe that the maximal k for 

^ For example, if h = 0, F is a Euclidean point, whereas if h = 1, F is a curve in RJ^ . 
^ Traditionally, the following definitions are used. Let B be /c-separating in a discrete 
object A and A — B have exactly two connected components. A k- simple point of B 
is a point p ^ B such that B — p is still /c-separating. A /c-separating discrete object 
is called k- minimal if it does not contain any /c-simple point. The advantage of our 
version of the definition of fc-minimality is that it applies also to an object that is 
not ^-separating in another object. 
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which a /c-discretization of a surface appears to be a cover, corresponds to a 
minimal cover as defined in the previous section. 

Of a special interest are the two extreme cases k = 0 and k = n — 1 . As 
a matter of fact, 0 - and (n — l)-discretizations have been studied in the spe- 
cial cases when is a straight line or plane. Among the earliest works is the 
one of Bresenham who proposes an algorithm to discretize a straight line in the 
plane [5]. Reveilles [17] considers Bresenham lines in the framework of the arith- 
metic geometry. He shows that the Bresenham line, called also naive line, admits 
an analytical description. 3D naive planes and lines are also defined and stud- 
ied [10,12,13,11]. The naive lines/planes are proved to be the thinnest possible 
discretizations that approximate best the corresponding Euclidean lines/planes. 
It is easy to observe that they are (n — l)-discretizations. Naive discretizations 
have also been studied in the case of hyperplanes. Stojmenovic et al. [18] pro- 
pose a scheme for discretizing hyperplanes. Further related results within the 
Stojmenovic scheme are obtained in [20]. In [3] it is shown that the Stojmenovic 
discretization is an nD analog of naive discretization. As in the 2 D and 3D cases, 
a naive discretization of a hyperplane appears to be an (n — l)-discretization. 

0-Discretizations of 2 D lines and of planes are also well-studied. Such sort of 
discretizations have been defined and investigated in analytical setting [17,10], 
and are known as standard discretizations. In higher dimensions, standard dis- 
cretizations are considered in [ 2 ]. 

Definition 3 provides a structural framework for studying naive (n — 1)- 
discretizations and standard 0-discretizations for arbitrary surfaces. It also gives 
a more general view on the matter, considering the naive and the standard dis- 
cretizations as (extreme) particular cases within the class of /c-discretizations. 
One can expect that various /c-discretizations with A; 7 ^ 0 , n — 1 may be interesting 
for certain applications. 

We remark that discretizations based on the concept of Hausdorff distance 
have been proposed in [21,19]. More precisely, a Hausdorff discretization of a 
compact set K C is defined as the union of all subsets of minimizing the 
Hausdorff distance to K. One can observe that all the naive (n— l)-discretizations 
that are in a minimal Hausdorff distance from a Euclidean object contribute to 
its Hausdorff discretization. 

The Hausdorff and the /c-discretization elucidate discretization issues from 
different angle. The former one is mathematically elegant and has some interest- 
ing properties, see [19, Sections 3,4]. Besides, interesting relations hold between 
the Hausdorff discretization and other well-known discretizations. For instance, 
under certain conditions, the Hausdorff discretization is equivalent to the super- 
cover discretization defined above [19, Theorem 2]. 

On the other hand, with the study of /c-discretizations we pursue deeper un- 
derstanding of the structure of some natural discretizations (like the naive and 
the standard discretizations) within a more general setting. The /c-discretizations 
can be considered as the thinnest possible discretizations without tunnels of 
certain type, while the Hausdorff discretizations, in general, can be seen to 
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be thicker. In this sense, our approach is much in the spirit of some recent 
work [7,6,4] on thin polyhedral discretizations. 



4.3 Connectivity of /^-Discretizations 

In this section we report our basic result about connectivity of /c-discretizations 
of surfaces. 

Theorem 2. Let F C be an h-dimensional surfaee, — Ak- 

diseretization of F with 0 < k < n — 1 is max{h — 1, n — /c — l}-eonneeted. 

Idea of the Proof Let Dk{F) be a A;-discretization of F. Let v' and v" be 
arbitrary two voxels from Dk{F). Since Dk{F) is connected, there is an ordered 
set of voxels {r’l, r’ 2 , • • • , with vi = v' ^ v" = vi, such that any two consecutive 
elements 1 < i < / — 1, are at least 0-connected. To obtain a proof of the 

theorem, it suffices to show that Vi and Vi-^i are (h — l)- and (n — /c — l)-connected. 

1. {h — l)-eonneetivity. Let F be the facet of maximal dimension /, 0 < / < 

n — 1, so that F is common for Vi and If / > h — l, then Vi and 

are (h — l)-connected. Suppose that / < h — l. Let W be the set of all voxels 
that share the facet F. One can show that there is an ordered set of voxels 
W' = {vi,V 2 , . . . m < I, W C W, so that any two consecutive voxels 

in W' are (h — l)-connected. If one assumes the opposite, one can show that 
there must be an (n — l)-tunnel in Dk{F)^ which contradicts the definition of k- 
discretization. The proof is based on a detailed analysis of the structure of the 
multidimensional discrete space, and will be included in the full journal version 
of the paper. 

2. (n — k — 1)- eonneetivity. By definition, if Dk{F) is h-minimal, then it is k- 

tunnel-free. Then one can show that if Dk{F) is not {n — k — l)-connected, it 
will contain /c-tunnels. The details will be included in the full paper. □ 

The above theorem implies a number of corollaries about certain important 
special classes of surfaces. 

Corollary 1. Let F be an {n — 1) -dimensional surfaee in R^. 

(a) For any 1 < k < n — 1, a k-diseretization of F is {n — 2)-eonneeted. Thus, 
in partieular, any naive {n — 1) -diseretization is {n — 2)-eonneeted. 

(b) Any standard eover (0- diseretization) of F is {n — l)-eonneeted. 

For the case h = 1, i.e., when the surface is a curve, we obtain the following 
corollary. 

Corollary 2. A k-diseretization of a eurve in R^ is {n — k — l)-eonneeted. (In 
partieular, a standard 0- diseretization is (n — l)-eonneeted, and a naive (n — 1)- 
diseretization is 0-eonneeted, see Figure 6a.) 

In the case of a naive (n — l)-discretization we have the following corollary. 

Corollary 3. A naive {n—1)- diseretization of a h-dimensional surfaee is (h-l)- 
eonneeted. (Thus a naive {n — 1) -diseretization of an {n — 1) -dimensional surfaee 
is {n — 2)-eonneeted, in aeeordanee with Corollary 1, see Figure 6b.) 
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Fig. 6. Illustrations to Corollaries 2 and 3. a) A portion of naive 2-discretization 
of the Euclidean line between the points A = (0,0,0) and B = (11, 13, 18). It 
has 0-tunnels and is 0-connected. b) A portion of naive 2-discretization of the 
Euclidean plane — 4x — 9^ + ISz = 0. It has 1-tunnels and is 1-connected 



4.4 Applications: /c-Discretizations of Hyperplanes and Halfspaces 

In this section we illustrate the concept of /c-discretizations in the important 
case when the surface is a hyperplane. We recall that an arithmetic plane^ P = 
P(/3, oi, 02 , • • • ,<^n,^) is defined as the set of integer points which satisfy the 
double linear Diophantine inequality 0 < /3 -h Yl^=i 

Let H \ h P = 0 be a hyperplane. Let j be the least index for 

which Gj 7 ^ 0, and let aj > 0^. Then a standard cover of H has been defined as 

(I) 

It is proved [2, Proposition 13] that St{H) is indeed a cover, which is (n — 1)- 
connected, 0-minimal, and n-bubble-free. One can show that the Hausdorff dis- 
tance between H and St{H) is the minimal possible over all 0-tunnel- free dis- 
cretizations of H. This, together with the 0-minimality, implies that the above 
definition is in accordance with the global definition of a 0-discretization (stan- 
dard cover) given in the previous section. More precisely, we can state the fol- 
lowing theorem. 

Theorem 3. The set of voxels determined by Condition (1) is a 0- discretization 
(standard cover) of the hyperplane H \ h P Ym=i ~ 0* 

® Sometimes also called discrete analytical plane. 

^ Then the hyperplane H is said to have a standard orientation [2]. 



St{H) = {xeZ^ 



2^2 = 1 



<bp y^^ajXi 



< 
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This result can be extended to arbitrary /c-discretizations. As a first step we re- 
veal which arithmetic planes are /c-discretizations. In [3] the connection between 
the tunnels of arithmetic hyperplanes and their thickness is investigated, and 
the following basic result is obtained^. 

Theorem 4. [3, Proposition 9] Let P = P(6, ai,a 2 , . . . ,an,uj) = {x G Z'^\0 < 
b + Yl^=i < (^} be a diserete hyperplane, where 6 > 0, > 0 for all i, 

and Oi < Oi-^i for 1 < i < n — 1. Then, if uj < an, the hyperplane has {n — 1)- 
tunnels; For 0 < k < n, if X^r=/c+i < uj < hyperplane has 

{k — l)-tunnels and is k-separating; If uj > hyperplane is tunnel- 

free. 

The next theorem characterizes the arithmetic planes that are /c-discretizations. 

Theorem 5. Let P be a diserete hyperplane as in Theorem 4- If uJ = Yl^=k-\-i 
then P is a k-diseretization. 

The proof will be included in the full paper. The notion of surface /c-discretization 
or /c-cover can be extended to subsets of with topological dimension n. For 
this, one can use the corresponding discretization of the boundary of the given 
set. Let us consider, for instance, a half-space E determined by a linear inequal- 
ity a\Xi -h 02X2 + . . . + OnXn < b. One can obtain a /c-discretization of E as 
Dk{E) = {x e : b P Yl"i=i where uj = ^ 

dard discretization convention (see [2] for getting acquainted with the notion of 
discretization convention). In particular, the set of voxels St{E) = {x e Z'^ : 
b P Yl^=i ^ with UJ = Yl^=i ^ 0-discretization (standard cover) 

of t;. 

5 Concluding Remarks 

In this paper we have studied minimal covers of bodies and /c-discretizations of 
surfaces of arbitrary dimension. A further direction of research can be seen in 
studying minimal covers and /c-discretizations of some important special classes 
of Euclidean objects, like affine subspaces of arbitrary dimension 0 < m < n — 1 
(m-fiats), simplexes or some other interesting classes of polytopes, spheres, etc. 

It would also be interesting to study the properties of /c-discretizations for 
different choices of a metric d in the definition of Hausdorff distance. 
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® In the special case of discrete hyperplanes, a j-tunnel can be defined as follows. A dis- 
crete plane P = P{P, oi, 02 , . . . , On, cj) has a j-tunnel if there are two j-neighboring 
voxels V = (ai, . . . , Vn) and w = {wi, . . . , Wn) such that aiv\ -h . . . -h On'Cn + /5 < 0 
and OLiWi -h . . . -h OLnWn + /d > 0 [17]. It is not difficult to see that this definition is 
in accordance with the more general Definition 2. 
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Abstract. We first propose for digital surfaces a notion analogous to 
the notion of strong homotopy which exists in 3D [1] . We present an as- 
sociated parallel thinning algorithm. The surface of an object composed 
of voxels is a set of surfels (faces of voxels) which is the boundary be- 
tween this object and its complementary. But this representation is not 
the classical one to visualize and to work on 3D objects, in frameworks 
like Computer Assisted Ceometric Design (CACD). For this reason we 
propose a method for passing efficiently from a representation to the 
other. More precisely, we present a three- step algorithm to polyhedrize 
the boundary of a voxel object which uses the parallel thinning algorithm 
presented above. This method is specifically adapted to digital objects 
and is much more efficient than such existing methods [12]. Some exam- 
ples are shown, and a method to make the reverse operation (discretiza- 
tion) is briefly presented. 

Keywords: digital surface, thinning, strong homotopy, parallel algo- 
rithm, polyhedrization. 



Introduction 

To study a three-dimensional object, its surface is often used because it is less 
costy (in particular concerning the memory cost) than studying the whole 3D 
volume [2, 3, 4, 7, 8]. For this reason, the discrete representation of digital surfaces 
of 3D objects is of practical interest. Indeed, it allows us to compute easily the 
union, intersection and difference (i.e. boolean set operations) of two discrete ob- 
jects represented by their surfaces. On the other hand, classical modeling tech- 
niques use a continuous representation for surfaces. More precisely, to encode 
surfaces, a representation is often used to vizualize and to work on 3D objects: 
polyhedrons. This representation has a lot of advantages, particulary in Com- 
puter Assisted Geometric Design (CAGD) applications. However, floating points 
computations are much less relevant to perform boolean set operations. Indeed, 
there are some problems when the objects we want to work on are tangeant to 
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each other. This leads us to find a way to pass efficently and quickly from a 
discrete representation to a polyhedron representation, and conversely. 

The existing methods to polyhedrize the surfaces, except in [12], are not 
especially adapted to our initial data structure for the discrete surfaces, namely 
a set of surfels. 

The purpose of this paper is to introduce an algorithm which enables us to 
compute a polyhedrized surface from a discrete surface. 

The classical methods, using for exemple the Voronoi diagram and the Delau- 
nay triangulation [17], are not effective for the non-euclidien spaces like surfaces. 
So, we use a topological approach for the discrete surfaces (see [5] and [6]). This 
leads us to define a “topological Voronoi diagram” on these surfaces which allows 
us to built the triangular faces of our polyhedrized surface. 

In the first part, we present a parallel thinning algorithm to compute the 
skeleton of a set of surfels. This algorithm is based on an analogue for the 
surfaces of the notion of strong homotopy existing in 3D and introduced by G. 
Bertrand ([1,13]). 

Then, we present a polyhedrization method which is based on the use of a 
germ-obtained skeleton. The germs are particular surfels chosen, with respect 
to the maximal curvature, in different locations of our surface; the higher is the 
curvature, the more numerous are the germs. Then, using the parallel algorithm 
previously presented, we compute the skeleton of the set of surfels composed of 
our surface from which the germs are removed. The obtained skeleton enables 
us to draw triangles which will be the faces of the polyhedrized surface. These 
triangles are built during a cover of the skeleton. Since the germs are more 
numerous where the surface is very curved, the number of faces is higher in such 
locations. Some examples of the polyhedrization algorithm results are given. 

The presented method, which can also be used for (non conservative) data 
compression of binary 3D images, is much faster than the method introduced 
in [12]. Moreover, it produces more regular faces, which are all triangles, which 
makes it much more convenient for CAGD applications. 

A method to make the reverse operation (discretization) is briefiy presented. 
It is based on a particular data structure: an array the cells of which contain 
chained lists of surfels. Note that the boolean set operations can easily be per- 
formed using this data structure. 

1 Notations and Basic Notions 

Let X and E be two sets such as A C We denote by X the complement of X 
in E and by card{X) the number of elements of X. In the sequel, an adjacency 
relation on X is an antirefiexive symmetric binary relation on X. Given a an 
adjacency relation, an a-path in X with length I is a sequence (xq,xi, . . . ,xi) 
in X and such as Xi-i is a-adjacent to for i = 1, . . . , L An a-path is called 
closed if xq = xi and simple if the Xi are pairwise distinct. Two elements y and y' 
of X are said to be a-connected in X if there exists an a-path (xo,xi, . . . ,x^) 
contained in X such disy = xq and y' = xi. The a-connectedness is an equivalence 
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relation and its equivalence classes are called a- connected components. We denote 
by Caipha{X) the set of connected components of X and X is said to be a- 
connected if card{Ca{X)) = 1. 



1.1 Surface of a 3D Object 

A 3D object O is a set of points in These points, which we can represent as 
unit cubes centered at a point (z, k) G Z^, are called voxels. Two voxels are said 
to be 6- adjacent if they share a face, 18- adjacent if they share a face or an edge. 
We derive from this as above the classical notions of an n-path and n- connected 
component with n G {6, 18}, and we denote by Cn{0) the set of n-connected 
components of the object O. We define the n-neighborhood of a voxel x denoted 
by Nn{x) by Nn{x) = {y ^ Z^/y is n-adjacent to x}. 




Fig. 1. Examples 




Now, a surf el is a couple {v\,V 2 ) of 6-adjacent voxels v\ and V 2 . A surfel 
may be seen as an unit square shared by two 6-adjacent voxels, and then we call 
surface of an object O C Z^ the set of all surfels such as t’l G O and V 2 G 

O. We can define adjacency relations between surfels, and these relations depend 
on the one we consider on voxels. We consider the surface X of an object O. Now, 
we call e- adjacency the adjacency relation on surfels considering the edges. 
Under this one, a surfel has exactly four neighbors, one per edge. For example, 
let call X the grey surfel pointed by a full arrow on the object of the figure 1, 
(b). If we consider the 6-adjacency, the object is composed of three 6-connected 
sets (the three voxels) , and the e-neighbors of x are the four surfels of the lower 
voxel which share an edge with x. By contrast, if we consider the object with the 
18-adjacency, the three voxels are 18-connected and the four e- neighbors of x 
are the two other grey surfels and the two surfels pointed by the dotted arrow. 
We are now going to define an adjacency relation associated with the vertices 
of the surfels. First, we define a loop in X as an e-connected component of the 
set of all surfels in X which share a given vertex (figure 1). One vertex may 
define two loops (see figure 1, b) and c)), so the loops are a way to duplicate 
formaly vertices. Thus, two surfels are called v- adjacent if they belong to a 
common loop. Then, e and v are adjacency relations on X from which follow the 
classic notions of n-paths^ n -neighborhoods^ and e-connected components^ with 
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Fig. 2. Counter-example Fig. 3. A decentred skeleton 





Fig. 4. No parallel strategy 



n G {e, v}. The kind of surfaces defined above satisfies the Jordan property [11], 
namely a e-connected component of a surface separates the space into two parts: 
a 6-connected and a 18-connected one [16]. 

Afterwards, we will assume that each loop of our surface is a topological disk. 
In other words, two t’-adjacent surfels which are not e-adjacent cannot belong 
simultaneously to two distinct loops. The object of the figure 2 does not satisfy 
our hypothesis because the two loops defined by the marked vertices contain 
surfels which are r’-adjacent but not e-adjacent. (see [9] for a short discution on 
this hypothesis). 



1.2 Neighborhood Graph of Surfels 

Let X G A, n G {e, v} and X C A be a set of surfels. We introduce the neighbor- 
hood graph of surfels of x as follows. We have assumed that each loop in A is a 
topologic disk, but there exist some r»-neighborhoods Ny (x) which are not topo- 
logical disks (see the surfels of the two loops defined by the marked vertex on fig- 
ure 1 (b). Then we have to define an other “topology” under which Ny{x)U{x} is 
a topological disk. Two surfels si and S2 in Ny{x)U{x} are said to be ex-adjaeent 
(respectively Vx-adjaeent) if they are e-adjacent (respectively v-adjaeent) and 
are contained in a common loop which contains x. Then, we denote by Ge(x,X) 
(respectively Gy(x,X)) the graph whose vertices are the surfels of Ny{x) H X 
and whose edges are the pairs of e^^-adjacent (respectively adjacent) surfels 
of Ny{x) n X. The set of all connected components of Gn(x,X) which are n- 
adjacent to x is denoted by Gf{Gn{x^X)). Note that Gf{Gn{x^X)) is a set of 
sets of surfels and not a set of surfels. 

A surfel x is said to be n-isolated in X if Nn{x) H A = 0 and to be n-interior 
in A if Nn{x) nX = 9. 
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2 Topology Preservation and Thinning 

2.1 P-Simple Surfels 

First, we introduce the notion of n-simpleness of surfels [10]: 

Definition 1. [9]. Let X C X be a set of surf els of a digital surfaee X. A surfel 
X G X is said to he n-simple in X if and only if x is not n-interior in X and if 
eard{Cf{Gn{x,X)) = 1. 

Intuitively, a surfel is n-simple in X if its deletion from X does not change 
the topology of X. 

Definition 2. Let Y C X C T. The set Y is said to he (lower) n-homotopie 
to X if and only ifY ean be obtained from X by sequential deletion of n-simple 
surfels. 

Thus, we can obtain a skeleton from a set of surfels by a sequential erosion 
analogous to classical 2D sequential thining [10,9]. As in the planar 2D case [10], 
this method presents some drawbacks : there is no parallel way to test the n- 
simplicity of surfels (on figure 4, the surfels s\ and 52 of the set X composed of 
grey surfels are n-simple in X, but we cannot remove simultaneously these surfels 
without changing the topology). Moreover, the shape of the skeleton depends on 
the order with which the n-simplicity of surfels is tested (see figure 3 if we use 
the lexicographical order to test the n-simplicity of surfels). Finally, the strategy 
using end surfels (surfels which have only one n-neighbor) causes the apparition 
of a lot of undesirable branches (see figure 6). 

For these reasons, we are now interested in a new thinning method of sets of 
surfels using a parallel strategy. This method is an analogue of a parallel thinning 
algorithm existing in 3D introduced by G. Bertrand [1], and using the notion of 
P-simple points. 

First we must state some theoretical definitions and results. 

Definition 3. Let Y C X C X. The set Y is ealled strongly n-homotopic to X 
if for any Z sueh that Y C Z C X , the set Z is n-homotopie to X . If Y is 
strongly n-homotopie to X, then we eall the set S = X\Y a strongly n-simple 
set in X. 

Note that if Y is strongly n-homotopic to X, then for any set S C X\Y', the 
set X\S is always n-homotopic to X. 

Definition 4. Let X C X , P C X and x e X. The surfel x is ealled Pn-simple 
in X if, for any set S sueh that S C P\{x}, the surfels x is n-simple for X\S. 
We denote by Sn{P) the sets of the Pn-simple surfels. A set E ^ X is said 
Pn-simple in X if E C Sn{P)- 

The Pn-simple surfels satisfy a strong constraint, namely a Pn-simple surfel 
can be removed whatever the set contained in P already removed. Now, we 
establish a relationship between the notions of strongly n-simplicity and Pn- 
simplicity which is directly deduced from the definitions. 
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Theorem 1. Let Y C X C V. The set Y is strongly lower n-homotopie to X if 
and only if the set P = X\Y is Pn-simple for X. 

Now, we propose a local caracterization of the P^-simple surfels, which makes 
effective the notion of a Pn-simple surfel. 

Definition 5. We denote by An{x,X) the set of surfels defined by 

y e An{x,X) ^ 3 a n^-path in Ny{x) H X from y to an n-neighbor of x 

and we denote by Tn{x^ X) the number of Ux-eonneeted eomponents of An{x^ X). 

In other words, An{x^X) is the union of all connected components of 
Gn{x^X) which are n-adjacent to x. 

Theorem 2. Let P C X C X and x G P. We denote R = X\P. Then the 
surfel X is Pn-simple in X if and only if the four following properties are satisfied: 

— Tn{x,^ = 1 ; 

— Tn{x,X) = l; 

— \/y ^ Nn{x) n P, y is n-adjaeent to An{x^R); 

— \/y e Nn{x) ^P, y is n-adjaeent to An{x,X). 

This theorem enables us to test the Pn-simpleness of surfels by an algorithm 
using only local considerations, and enables us to define an algorithm using a 
parallel strategy. 

2.2 Parallel Algorithm and Results 

First, we present a sequential thinning algorithm of a subset X of a surface X, 
implemented by A. Lenoir [9,4]. The two-step principle is the following one: while 
there exist some n-simple surfels in X, 

— for all X in X, if X is n-simple, mark x with REMOVABLE; 

— for all surfels in X, if x is marked with REMOVABLE, and x is n-simple 
in the remaining set, remove x from X. 

To preserve the general shape of our initial set of surfels, we impose on the end 
surfels to be unremovable (i.e. surfels of X with exactly one neighbor in X). 

Now let us present our parallel algorithm (see figure 5). Only the surfels 
n-adjacent to X can be be removed from a set X at each step. So, to test 
the Pn-simplicity of the surfels in X, our set P will be the set of the surfels 
in X which are n-adjacent to X. Now, a surfel which is not P^ -simple may be n- 
simple. So, after using the parallel algorithm, we will use the sequential one to 
remove all n-simple surfels which are not end surfels. The parallel algorithm is 
presented on figure 5. 

The results of this algorithm (see figure 6) are, as in the 3D case, quite 
better than the ones obtained by the sequential algorithm. Eirst, there are less 
undesirable branches on the final squeleton obtained by the parallel algorithm. 
Next, the symmetry of this skeleton is closer to the symmetry of the initial set 
of surfels. 
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Parallel Algorithm 



M = array of all surfels; 

P = set of the surfels on the border; 

R = set of the interior surfels; 

Repeat 

made := FALSE; 

For all X in P 

If (x is P-simple) 

Then mark x with REMOVABLE; 
made := TRUE; 

End For 
For all X in P 

If (x is marked with REMOVABLE) 
Then remove x from M; 

End For 

Update P; 

While made; 



Fig. 5. Parallel algorithm 



3 Polyhedrization 

The method of polyhedrization presented here consists of three steps. 

3.1 Step One: The Choice of the Germs 

The first step consists in choosing some special surfels on the surface of the 
object we want to polyhedrize. These surfels, called germs^ are distributed ac- 
cording to the curvature of the surface: the more curved is the surface, the more 
numerous are the germs. First, using the method of A. Lenoir [3,4], we compute 
the maximal curvature of all the surfels of the surface, and we choose a surfel at 
random, which is our first germ, on the surface. Then: 

1. determination of a maximal width I depending on the maximal curvature of 
the surfel; 

2. breadth-first exploration (using a FIFO structure) of the e-adjacency graph 
from the germ until the width I is reached; we mark the covered surfels; 

3. if there are some non- marked surfels, choice of a new germ among the non- 
marked neighbors of the marked surfels, and back to the step 1. Otherwise, 
STOP. 

At the third step, for better results, the next germ is the one which has the 
greater maximal curvature. Indeed, with this choice, we obtain more faces at the 
more curvated places. 
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Fig. 6. Comparison of results 



An interesting point is that we can choose a limit width L for the breadth- 
first exploration: if the computed width I is larger than L, then we replace I 
by L. Moreover, we use a slowdown strategy at the more curved places during 
the breadth-first exploration. So we can have germs more or less dense on the 
less curved places of the surface. 

We denote by G the set of all germs thus obtained. Some results of this 
strategy are presented on figure 7. 

3.2 Step 2: Obtaining of a “Topological Voronoi Diagram” 

Let E be the surface of the object. To obtain the “topological Voronoi diagram” 
(i.e. a skeleton in which there are no more simple surfels), we use the thinning 
algorithm presented in the first section on the set E\G, without the end surfels 
conditions. We obtain the skeletons visible on figure 8 for the three previous 
presented objects. 

The skeletons are obviously more dense where the germs are more numerous. 
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Sphere, R— 15 Cork Blood vessels 

Fig. 7. The choosen germs (in white) on the surfaces of three voxels objects 




3.3 Step 3: The Obtaining of the Faces 

Now, from the skeleton and the germs, we are going to cover the triangular faces 
which will form our polyhedrized surface. 

Let consider the figure 9, (a). The germs are represented by the black points, 
and the squeleton by the thick black lines. There exist two kinds of surfels 
on the skeleton: the surfels of the branches (surfels which have exactly two 
neighbors in the skeleton) and the surfels of the intersections (surfels which have 
strictly more than two neighbors). So, we cover the skeleton by covering of the 
branches from an intersection to an other one. Let Ii and I 2 be the barycenters 
of two intersections connected by a branch. The segment hh is the edge of 
two triangles, and the other vertices of these triangles are the germs which the 
branch separates (see figure 9, (b)). 
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(a) 



(b) 



Fig. 9. Obtaining of triangular faces from the skeleton 



3.4 Results 

The results of the algorithm for the sphere with radius 15, the cork and the 
blood vessels are presented on figure 10. The initial numbers of surfels of these 
surfaces are respectively 4254, 81066, 10792 and the chosen limit widths L 4, 8 
and 4. The numbers of triangular faces obtained by the algorithm are 504, 3914 
and 1316. It takes a few seconds to compute the polyhedrized surfaces (for the 
bigger example, the cork, it takes about 30 seconds on Ultra Sparc Sun^'^ work 
stations with bi-processors). 




Fig. 10. The polyhedrized surfaces 
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As we can see on the figure, these faces are regular, well located and one gets 
good quality surfaces, even on thin objects (see the result on the blood vessels). 
The worst results are near the angles (see base of the cork), but they remain 
satisfying. 

4 Discretization 

Now, we have an effective tool to obtain a polyhedrized surface from a discrete 
one. Here, we present briefly a method to realize the reverse operation. 

First, we choose a step of discretization P (smaller than the size of the 
surface S we want to discretize). Then, we are going to work according to the 
first axis of the classical coordinates space. We define a 2-dimensionnal array 
(kP) * {kP)j with A: G N* according to the plane containing the second and the 
third axis. Each cell of this array contains a chained list of surfels (figure 11). The 
method to fill up this array is to study the intersection between the line Dij (the 
line coming from the cell (i^j) and parallel to the first axis) and S (figure 12). 
During the recover of Dij , for each intersection we have to determine by a parity 
rule if we enter into the surface or if we get out. Then, we can compute the 
coordinates of the deduced surfels and obtain the discretized surface. 




Fig. 11. Data structure 




Fig. 12. Method 



5 Conclusions and Perspectives 

Following section 2, we have obtained an analogue of the notion of P^-simplicity 
existing in 3D for discrete surfaces. Moreover, the results of thinning using 
the parallel strategy are far better than the ones deriving from the sequential 
method. 

Next, as for the polyhedrization, the obtained surfaces are rather satisfying. 
The computing time for results is quite short (a few seconds against several 
days for the method of [12]) and the obtained faces are much more regular, in 
comparison to what would be obtained by any other pre-existing method [12]. 
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Note that the boolean set operations between the objects surrounded by 
the discretized surface can easily be performed using the obtained discrete data 
structure. 

In future works, we shall include this work into a multi-scale context. Indeed, 
if we have to work with a set of multi-scaled objects, it is more relevant to con- 
sider different scales related to these objects. Thus, we will be able to represent 
the whole multi-scaled set with one method. 
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Abstract. We present a new algorithm for modeling and characteriz- 
ing shape changes in 3D image sequences of biomedical structures. Our 
algorithm tracks the shape changes of the objects depicted in the image 
sequence using an active surface algorithm. To characterize the deforma- 
tions of the surrounding and inner volume of the object’s surfaces, we 
use a physics-based model of the objects the image represents. In the 
applications we are presenting, our physics-based model is linear elas- 
ticity and we solve the corresponding equilibrium equations using the 
Finite Element (FE) method. To generate a EE mesh from the initial 3D 
image, we have developed a new multiresolution tetrahedral mesh gener- 
ation algorithm specifically suited for labeled image volumes. The shape 
changes of the surfaces of the objects are used as boundary conditions to 
our physics-based EE model and allow us to infer a volumetric deforma- 
tion field from the surface deformations. Physics-based measures such as 
stress tensor maps can then be derived from our model for characterizing 
the shape changes of the objects in the image sequence. Experiments on 
synthetic images as well as on medical data show the performances of 
the algorithm. 

Keywords: Deformable models. Active surface models, Einite elements. 
Tetrahedral mesh generation 



1 Introduction 

Today, there is a growing need for physics-based image analysis of deformations 
in image sequences (e.g. real-time MRI of the heart, image sequences showing 
brain deformation during neurosurgery, etc.). The subject has recently lead to 
considerable interest in the medical image analysis community [1,2, 3, 4, 5, 6, 7, 8]. 

Medical image analysis has in the past relied heavily upon qualitative descrip- 
tion. Today, modern applications can be enabled by providing to the clinician 
quantitative data derived from these images. For example, rather than simply 
observing erratic heart beat with real-time MRI, clinicians want to measure 
ejection fraction and estimate stress in the heart muscle quantitatively. 
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Shape- and surface-based image analysis is being increasingly used in the 
bio- medical image analysis community, e.g., for pathological analysis [9] and for 
tracking deformations [10]. Shape-based models are also being used for image 
segmentation [11,12,13] to constrain active surface models [1,14]. Such active 
surface models do not allow any physical interpretation of the deformation the 
surfaces undergo. Also, no volumetric deformation field is available. 

In an attempt to overcome these problems, several authors have proposed to 
use a physics-based model to infer a volumetric deformation field (e.g. [15,16]) 
from surface-based deformations. But the used parameters were determined 
heuristically, and one could therefore not exploit the information generated by 
the model to extract biomechanical properties. Other authors have also proposed 
to use physical deformation models to constrain a volumetric deformation field 
computed from image data using elastic [17] or even viscous fluid deformation 
models [18,19]. But in these applications, the models did not account for actual 
material characteristics, because the matching is done minimizing an energy 
measure that consists of a weighted sum of an image similarity term and a re- 
laxation term representing the potential energy of a physical body (e.g., elastic). 
Therefore, the actual physics of the phenomenon cannot be properly captured by 
these models. In order to capture a physics-based deformation field, one needs 
to use biomechanical models and image-derived forces for deforming them. 

In the context of brain shift analysis, there recently has been a significant 
amount of work directed towards simulation [7] using models driven by physics- 
based forces such as gravity. Skrinjar et al. [4] have proposed a model consisting 
of mass nodes interconnected by Kelvin models to simulate the behavior of brain 
tissue under gravity, with boundary conditions to model the interaction of the 
brain with the skull. Miga et al. [3] proposed a Finite Element (FE) model based 
on consolidation theory where the brain is modeled as an elastic body with an 
interstitial fluid. They also use gravity induced forces, as well as experimentally 
determined boundary conditions. 

Even though these models are very promising, it remains difficult to accu- 
rately estimate all the forces and boundary conditions that interact with the 
model. 

The cardiac image analysis community has been using physics-based models 
- mainly FE models - they deform with image-derived forces. These models then 
provide quantitative, and physically interpretable 3D deformation estimates from 
image data. Papademetris et al. [20] derive the forces they apply to the FE model 
from Ultrasound (US) using deformable contours they match from one image to 
the next one using a shape-tracking algorithm. Metaxas et al. [1] derive their 
forces from MRI-SPAMM data for doing motion analysis of the left or right 
ventricle [21,22]. 

In the context of deformable brain registration, Hagemann et al. [5] use a 
biomechanical model for registering brain images, but they enforce correspon- 
dances between landmark contours manually. Moreover, the basic elements of 
their FE model are pixels, which causes the computations to be very slow. Kyr- 
iacou et al. [23] study the effect of tumor growth in brain images for doing atlas 
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registration. They use a FE model and apply concentric forces to the tumor 
boundary to shrink it. In these two studies, the experiments were performed in 
2D, thereby limiting the clinical utility and the possibility to efficiently assess 
the accuracy of the methods. 

We propose to merge the prior physical knowledge physicians have about 
the object that is being imaged with the information that can be extracted 
from the image sequence to obtain quantitative measurements. We extract shape 
information of the objects in the image sequence using an active surface model, 
and characterize the changes the objects undergo using a physics-based model. 

The idea is similar to that used for cardiac analysis; we track boundary 
surfaces in the image sequence, and we use the boundary motion as input for 
a FE model. The boundary motion is used as a boundary condition for the FE 
model to infer a volumetric deformation field, as proposed in [5]. 

The main contribution of this paper is that instead of using a generic FE 
model that is fitted to the image data as it is done for cardiac image analysis [1], 
or using the pixels (or voxels) as basic elements of the FE model, we propose 
an algorithm for generating patient-specific tetrahedral FE models from the ini- 
tial 3D image in the sequence, with locally adaptable resolution, and integrated 
boundary surfaces. Also, this enables us to perform computations in 3D, without 
manual interaction, and moreover, on a limited number of elements, with equiv- 
alent precision, and in a reasonable amount of time on a common workstation, 
thanks to an efficient implementation of the FE deformation algorithm. 

2 Description of the Algorithm 

There are two important points for doing physics-based modeling of the defor- 
mation in 3D image sequences. One first needs to have a prior bio- mechanical 
model of the object represented by the image, i.e., the constitutive equations of 
the bodies (elastic, fluid, viscous fluid, etc.) represented in the image. On the 
other hand, one also needs a way of applying forces and boundary conditions to 
the model using the image information. 

In this work, we have chosen to model image structures as elastic bodies. More 
elaborate models can of course very easily be integrated into our algorithm. Thus, 
we assume that the objects that are being imaged have an elastic behavior during 
deformation. The deformations will be tracked using the boundary information of 
the objects in the image sequence. The boundary surfaces are deformed towards 
the boundaries of the next 3D image in the sequence using an active surface 
algorithm. The deformation field of the boundary surfaces is then used as a 
boundary condition for our bio-mechanical model, that will be used to infer the 
deformation field throughout the entire volume. 

This will provide us with physically realistic and interpretable information 
(such as stress tensors, compression measures, etc.) of the imaged objects during 
the whole sequence. 
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3 Mathematical Formulation 



Assuming a linear elastic continuum with no initial stresses or strains, the po- 
tential energy of an elastic body submitted to externally applied forces can be 
expressed as [24] 

E=- [ o-^e (in+ [ Fu dQ (1) 

2 JQ JQ 

where u = u(x,^,z) is the displacement vector, F = F(x,^,z) the vector rep- 
resenting the forces applied to the elastic body (forces per unit volume, surface 
forces or forces concentrated at the nodes), and i7 the body on which one is 
working, e is the strain vector, defined as 

/ du du du du du du du du 

\dx^ dy^ dz^ dx dy^ dy dz^ dx~^ dz ) 



and a the stress vector, linked to the strain vector by the constitutive equations 
of the material. In the case of linear elasticity, with no initial stresses or strains, 
this relation is described as 



a = 



^y-) '^yz-j '^xz) — 



(3) 



where D is the elasticity matrix characterizing the properties of the material [24] . 

This equation is valid whether one is working with a surface or a volume. 
We model our active surfaces, which represent the boundaries of the objects in 
the image, as elastic membranes, and the surrounding and inner volumes as 3D 
volumetric elastic bodies. 



Within a finite element discretization framework, an elastic body is approxi- 
mated as an assembly of discrete finite elements interconnected at nodal points 
on the element boundaries. This means that the volumes to be modeled need to 
be meshed, i.e. divided into elements. Our meshing algorithm will be described 
in the next section. 

The continuous displacement field u within each element is a function of 
the displacement at the element’s nodal points \if weighted by its shape func- 
tions Nf = Nf{x,y,z) (4). 



u= ^ NfyXt (4) 

i=l 

The elements we use are tetrahedra {N nodes = 4) for the volumes and tri- 
angles for the membranes {Nnodes = 3), with linear interpolation of the dis- 
placement field. Hence, the shape function of node i of tetrahedron el is defined 
as: 

Nfi^) = K(af + bfx + cfy + dfz) (5) 

^ Superscript T designs the transpose of a vector or a matrix 
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where K = for a tetrahedron, and K = for a triangle. The computation 
of (volume, surface of el) and the other constants is detailed in [24]. 

For every node i of each element e/, we define the matrix ~Bf = 

The function to be minimized at every node i of each element el can thus be 
expressed as : 



E{uf) 




+F(x)A^f (x)uf df2 



(6) 



We seek the minimum of this function by solving for 
(6) then becomes : 



dE{uf) 

duf 



0. Equation 



Nr^ 



m 



^ Bf TiW^nf df2 = - 



j = l 



nodes 



F(x)iVf (x) df2 



(7) 



This last expression can be written as a matrix system for each finite element: 

J^el^el _ _peZ 

Matrices and vector are defined as follows: K.fj = Bf df?, 

Ff = f^FNf^ dQ] where every element refers to pairs of nodes of the 
element el {i and j range from 1 to 4 for a tetrahedron - 1 to 3 for a triangle). 
K.f - is a 3 by 3 matrix, and Ff is a 3 by 1 vector. The 12 by 12 (9 by 9 for a 
triangle) matrix , and the vector F®^ are computed for each element and are 
then assembled in a global system Ku = — F, the solution of which will provide 
us with the deformation field corresponding to the global minimum of the total 
energy. 

We now have constitutive equations that model surfaces as elastic membranes 
and volumes as elastic bodies. 



3.1 Finite Element Mesh Generation 

In [5], Hagemann et al. propose to use the pixels of the image as basic elements of 
his FE mesh. This approach does not take advantage of the intrinsic formulation 
of FE modeling, which assumes that the mechanical properties are constant over 
the element, suggesting that one can use elements covering several image pixels. 
Also, when performing computations in 3D, which is eventually what is needed 
for medical applications, the amount of degrees of freedom will be far too large 
(for a typical 256x256x60 MR image, this means about 12 million degrees of 
freedom at worst case !) to perform efficient computations in a reasonable time, 
even on high performance computing equipment. 

Most available meshing software packages do not allow meshing of multiple 
objects (e.g., [25,26]), and are usually designed for regular and convex objects, 
which is often not the case for anatomical structures. Therefore, we have imple- 
mented a tetrahedral mesh generator specifically suited for labeled 3D medical 
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images. The mesher can be seen as the volumetric counterpart of a marching 
tetrahedra surface generation algorithm, the only difference being that the initial 
tetrahedralization we use can have an adaptive resolution with sizes of tetrahedra 
depending on the underlying image content. 

The labeled 3D image from which the mesh needs to be computed is first 
divided into cubes of a given size, which are further divided into 5 tetrahedra 
with an alternating pattern so as to avoid diagonal crossings on the shared 
quadrilateral faces of neighboring cubes. The initial cube size determines the size 
of the largest tetrahedra the mesh will contain. Each tetrahedron is checked for 
subdivision according to the underlying image content. In our case, we decided 
to only subdivide tetrahedra that lie across boundaries of given objects, so as to 
have a detailed description of their boundaries. The edges of those tetrahedra to 
be subdivided are labeled for subdivision, and a new vertex is inserted at their 
middle point. This process is executed iteratively until the smallest edges have 
reached a specified minimum size. 

At each iteration, the mesh is re-tetrahedrized given the required edge sub- 
divisions for each tetrahedron. The main problem is to re-mesh tetrahedra that 
lie next to tetrahedra that are being split. For those tetrahedra, only some edges 
have been split. The mesh is therefore re-tetrahedrized using a case table with 
the 2^ = 64 possible edge splitting configurations. There are 10 basic config- 
urations, the others are symmetrical to those presented on Figure 1 (the gray 
coloring and the different node labelings are represented only to facilitate visual- 
ization of the tetrahedra’s subdivisions). From upper left to lower right. Figure 1 
successively presents the tetrahedrization if one edge is split, if two edges are 
split (2 possible configurations), if three edges are split (3 possible configura- 
tions), if 4 edges are split (3 possible configurations), if 5 edges are split and 
finally if all edges of the tetrahedron are split. 




2 tets 






2 prisms 




4 tets 



1 tet + 1 prism 






4 tets 



2 tets + 1 prism 
+ 1 pyramid 




Fig. 1. Different subdivisions of a tetrahedron given edge splittings 
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The resulting mesh contains tetrahedra, but also pyramids and prisms, which 
need to be further tetrahedrized. The main problem is to ensure consistency ^ 
between the diagonals of quadrilateral faces shared by 2 elements (pyramids or 
prisms). We split the quadrilateral faces along the shortest diagonal so as to have 
better shaped tetrahedra. The subdivision of a pyramid into two tetrahedra is 
straightforward given the diagonal of the quadrilateral face. For a prism, there 
are eight possible configurations for tetrahedralization given the diagonal con- 
figuration. Figure 2 presents the different possible tetrahedralizations of a prism 
given the diagonal’s configuration. If no straight tetrahedralization is possible 
(cases 1 and 8), a vertex is inserted in the middle of the prism which is then 
divided into 8 tetrahedra. 




Fig. 2. Different subdivisions of a prism given the quadrilateral faces’ diagonals 



Finally, we apply a marching tetrahedra-like approach to generate the ac- 
tual tetrahedral mesh with accurately represented boundary surfaces. For each 
tetrahedron, the image labels at its nodes are checked. A case table draws the 
elements to be added to the mesh. If all 4 nodes have non-object labels, no tetra- 
hedron is added to the mesh. If all nodes have an object label, the tetrahedron is 
added to the mesh as is. If the tetrahedron lies across two objects (i.e. all nodes 
do not have the same label), the subdivision of the original tetrahedron is looked 
up in the case table. 

Figure 3 shows the 5 basic cases. There are actually 16 cases, but the remain- 
ing cases are symmetric to cases 2, 3, and 4. The resulting prisms are divided 
into tetrahedra using the same approach as presented above. 

The resulting mesh structure is built such that for images containing mul- 
tiple objects, a fully connected and consistent tetrahedral mesh is obtained for 
every cell, whith a given label corresponding to the object the cell belongs to. 

^ A consistent tetrahedral mesh is built such that every (non-boundary) triangular 
face is shared by exactly 2 tetrahedra. 
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AAAA 

Fig. 3. Different tetrahedral cases depicted from left to right. Case 1: all nodes 
belong to structure; case 2: 3 nodes belong to structure; case 3: 2 nodes belong 
to structure; case 4: 1 node belongs to structure; case 5: no nodes belong to 
structure 

Therefore, different biomechanical properties and parameters can easily be as- 
signed to the different cells or objects composing the mesh. Boundary surfaces of 
objects represented in the mesh can be extracted from the mesh as triangulated 
surfaces, which is very convenient for running an active surface algorithm. 




3.2 Active Surface Algorithm 

The active surface algorithm deforms the boundary surface of an object in one 
volumetric scan of the sequence towards the boundary of the same object in the 
next scan of the sequence. This is done iteratively by applying image-derived 
forces (forces computed using the surface’s nodal positions u at iteration t) 
to the elastic membrane. The temporal variation of the surface can be discretized 
using finite differences, provided the time step r is small enough. This yields the 
following semi-implicit iterative equation 



u 



u 



t-i 



Ku^ = -F" 



which can be rewritten as : 

(I + tKu^) = -tF^ 



(9) 



(10) 



The external forces driving the elastic membrane toward the edges of the struc- 
ture in the image are integrated over each element of the mesh and distributed 
over the nodes belonging to the element using its shape functions (see Eqn. 4). 
Classically, the image force F is computed as a decreasing function of the gradi- 
ent so as to be minimized at the edges of the image [14,27]. A potential weakness 
of active surface methods is that for correct convergence, the surfaces need to be 
initialized very close to the edges of the object to be segmented. In [14], Cohen 
et al. proposed to use inflation or deflation forces (so-called balloon forces) to 
circumvent that problem. To increase the robustness and the convergence rate 
of the surface deformation, we compute our forces as a gradient descent on a 
distance map of the edges in the target image. The distance map is computed 
very efficiently using our fast distance transformation algorithm [28] . To prevent 
the surface from sticking on a wrong edge, or to prevent two sides of a thin 

^ Superscript t refers to the current iteration. 
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surface from sticking together on the same edge , we have included the expected 
gradient sign of the structure to be segmented in the force expression. Also, we 
have signed the distance map for improved convergence. More details about our 
active surface algorithm can be found in [29]. 



3.3 Inferring Volumetric Deformations from Surface Deformations 

The deformation field obtained for the boundary surfaces is then used in con- 
junction with the volumetric model to infer the deformation field inside and 
outside the boundary surfaces. 

The idea is to apply forces to the boundary surfaces that will produce the 
same displacement field at the boundary surfaces that was obtained with the 
active surface algorithm. The volumetric biomechanical model will then compute 
the deformation of the surrounding nodes in the mesh. 

Let u be the vector representing the displacement of the boundary nodes to 
be imposed. Hence, the equilibrium equation of the elastic body (Eqn. 1) needs 
to be rewritten with the following external forces to impose these displacements 
to the volume : 

F = K5 (11) 

The solution of the global equilibrium system (see Eqn. 8) will provide us 
with the displacement at all the nodes in the volumetric mesh with the imposed 
displacements at the nodes of the boundary surfaces delimiting the objects rep- 
resented in the mesh. This volumetric displacement field is then interpolated 
back onto the image grid using the shape functions of every element of the mesh 
(see Eqn. 4). 

Biomechanical parameters such as the stress tensors can then be derived from 
the displacements at the nodes using the stress-strain relationship (Eqn. 3) : 

o-i = ^ T>Ci= Y. (12) 

Tet\i^Tet Tet\i^Tet 



4 Experiments 

4.1 Synthetic Image Sequence 

We have tested the algorithm on a sequence of two 3D images of an elastic sphere 
being squeezed in a given direction. The object is surrounded by another elastic 
object. 

The original active surface extracted from the volumetric tetrahedral mesh is 
shown in Eigure 4b. Note that for this experiment, the initial tetrahedralization 
from which the mesh was computed was not multi-resolution, it had constant 
tetrahedral sizes. When running the active surface algorithm, the surface readily 
converges to the boundary of the ellipsoid in the target image. 

Eigure 5 shows 3D views of the mesh associated to the initial image and 
of the mesh after deformation, while Eigure 6 shows cuts through the original 
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Fig. 4. a) Slice 30 of the target image with a cut through the initial surface of 
the object overlayed. b) 3D surface rendering of the initial surface, c) The same 
slice with a cut through the deformed surface, d) 3D surface rendering of the 
deformed surface 

(a) and deformed mesh (b) and the deformation field (c) interpolated back onto 
the image grid (downsampled for clarity) overlayed on a cut through the target 
volume. One can very well observe the physical squeezing of the sphere onto the 
ellipsoid, also deforming the surrounding elastic medium. 




Fig. 5. a) Orthogonal cuts through the initial volumetric mesh with the sphere 
extracted and b) the same with deformed mesh 



4.2 Brain Shift Analysis 

In this experiment, we wanted to characterize the deformation the brain under- 
goes during neurosurgery after craniotomy. Two 3D MR volumetric scans were 
taken before and after craniotomy and partial tumor resection, and a significant 
shift could be observed. Figure 7 shows cuts through a sample tetrahedral mesh 
of the brain overlayed on the corresponding initial image. 

The active surface is extracted from the intraoperative scan at start of 
surgery, before opening the dura mater (see Figure 8a), and deformed towards 
the brain in a later intraoperative image (see Figure 8b). Figure 8c shows the 
3D surface deformation field the brain has undergone. One can observe that the 
deformation of the cortical surface is happening in the direction of gravity and 
is mainly located where the dura was removed. Part of the shift (especially on 
the left of the picture) is also due to the tumor resection that was done between 
the two scans. 

The deformation field obtained with the active surface algorithm is then used 
as a boundary condition for our biomechanical FE model, and allows us to infer 
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Fig. 6. a) Axial cut through original mesh overlayed on slice of original image 
and b) the same with deformed mesh on target image, c) The deformation field 
overlayed on slice of target image 




Fig. 7. Axial (a) , sagittal (b) , and coronal (c) cuts through tetrahedral mesh of 
the brain overlayed on corresponding cuts through preoperative image 

a volumetric deformation field. Elasticity parameters were chosen according to 
in-vivo studies carried out by Miga et al. [3]. Figure 9a shows the obtained 
deformation field overlayed on a slice of the initial scan, and Figure 9b shows 
the same slice of the initial scan deformed with the obtained deformation field. 
Figure 9 also presents the same slice of the target scan and the magnitude of 
the difference with the initial scan showing the closeness of the alignment of the 
brain. The gray-level mean square difference between the target scan and the 
deformed original scan on the image regions covered by the mesh went down 
from 181 to 88. The remaining difference is due to the fact that the model we 
used did not incorporate the ventricular thinning and the tumor resection that 
occurred between both scans. 

5 Conclusions 

We have presented a new algorithm for tracking and characterizing shape changes 
in 3D image sequences of physics-based objects. The algorithm incorporates a 
biomechanical model of the deforming objects and uses image-based information 
to drive the deformation of our model through an active surface algorithm. One 
of the main contributions of this paper is an improved algorithm for generating 
multi-resolution patient-specific FE meshes from labeled 3D images. 
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Fig. 8. Axial cut through active surface a) initial, and b) deformed, overlayed 
on corresponding slice of intraoperative MR image, c) 3D surface rendering of 
active surface with colorcoded intensity of the deformation field 




Fig. 9. a) Volumetric deformation field overlayed on initial intraoperative image 
slice, b) The same slice of initial image deformed using deformation field, c) Same 
slice of target image, d) The difference between target and deformed images with 
same slice 

The algorithm is a promising tool for the analysis of 3D medical image se- 
quences. It will provide physicians with a tool for measurement and physical 
interpretation of deformation in 3D image sequences, and can thus be of great 
aid in in the interpretation and diagnosis of these images. 
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Abstract. An object of interest is digitized if we acquire its 3-dimen- 
sional digital images by using techniques such as computerized tomo- 
graphic imaging. For recognition or shape analysis of such digitized ob- 
jects, we need the study of 3-dimensional digital geometry and topology. 
In this paper, we focus on one of the simplest geometric objects such as 
planes and study their geometric and topological properties which are 
expressed by using an algebraic method. 



1 Introduction 

In this paper, we deal with the geometric and topological properties of digitized 
objects which are expressed by using an algebraic method. An object of interest 
is digitized if we acquire its 3-dimensional digital images by using techniques 
such as computerized tomographic imaging. For recognition and shape analysis 
of such digitized objects, topological constraints such as topological equivalence 
in use of Euler characteristics [1], skeletons [2], combinatorial manifolds [4], etc. 
are used as well as geometric constraints which are given by algebraic equa- 
tions/inequations. In this paper, we focus on one of the simplest geometric ob- 
jects such as planes and study not only their digital geometry but their digital 
topology for 3-dimensional computer imagery. 

In the context of digital geometry, an algebraic approach for the study of 
geometric properties of planes in an integer lattice space has been proposed 
in [3, 4, 5, 6, 7]. They have defined a naive/standard plane which is a set of inte- 
ger lattice points and proposed the theory using algebraic properties of a lat- 
tice space. The algebraic properties such as local configurations of points in 
naive/standard planes have been derived by their algebraic approach. 

On the other hand, we have proposed an approach based on combinatorial 
topology [8,9] for the definition and construction of topological planes in an 
integer lattice space; they are called discrete planar surfaces and constructed by 
applying our boundary extraction algorithm [8]. In this paper, we clarify the 
relations between our discrete planar surfaces and naive planes such that our 
discrete planar surface for 18- or 26-neighborhood system is a triangulation of a 
naive plane in the aspect of combinatorial topology [10]; in [4], a triangulation 
has been introduced in a similar approach for a standard plane, but not for 
a naive plane. By seeing the topological structures of naive planes as discrete 
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planar surfaces, we show their geometric and topological properties in an integer 
lattice space, which can be derived from the algebraic properties of naive planes. 

2 Definition of Naive Planes 

Let R be the set of real numbers; denotes the 3-dimensional Euclidean space. 
A plane P in R^ is defined by 

P = {(x, y,z) : ax -\- hy cz -\- d = ^} (1) 

where a, 6, c, d are real numbers. Let Z be the set of real numbers; denotes 

the set of lattice points whose coordinates are all integers. We introduce the 
definition of planes in Z^ based on algebraic approach [5]. The naive plane is 
defined with respect to P by 

NP = {{x,y,z)eZ^ : 0 < ax by cz d < w} (2) 

where w = max{\a\, |6|, |c|}. The parameter w is called the width of NP. 

3 Definition of Discrete Planar Surfaces 

3.1 Definition of Discrete Combinatorial Surfaces 

In this subsection, we introduce the definition of surfaces in Z^ based on the 
approach of combinatorial topology [10]. In Z^ we define three different neigh- 
borhoods of a lattice point x = (z, j, k) as 

^m{x) = {{p, q, r) eZ^ :{i- pf + {j - qf + {k - rf < t} (3) 

for m = 6,18,26 corresponding to t = 1,2,3. They are called 6-, 18- and 
26-neighborhoods, respectively. Depending on each neighborhood, we define el- 
ements of 1-dimensional curves and 2-dimensional surfaces in Z^. These ele- 
ments are called 1- and 2-dimensional discrete simplexes and abbreviated as 1- 
and 2-simplexes, respectively. Suppose we define 0-dimensional discrete sim- 
plexes, which are called 0-simplexes, as isolated points in Z^. Then 1- and 2- 
simplexes are defined recursively as follows. 

Definition 1. An n-simplex for n = 1,2 is defined as a set of k points in Z^, 
[xi, CC 2 , • • • , ^k] = ^ 2 ^ • • • , so that the elosed eonvex hull ofx\^ X 2 , . . . , x/c 

is one of n- dimensional minimum nonzero regions in R^ whieh are hounded by 
the elosed eonvex hulls of {n — 1) -simplexes. 

According to Definition 1, a 1-simplex consists of two neighboring points in 
TA. The configurations of those two neighboring points depend on the neighbor- 
hood systems as shown in the first line of Table 1. A 2-simplex is defined as a 
set of points whose closed convex hull is bound by a set of the closed convex 
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Table 1 . Each n-simplexes for n = 1,2 is defined as a set of points in 
for the 6-, 18 - and 26 -neighborhood systems. All n-simplexes are obtained by 
rotation and translation of those in the table 




hulls of 1-simplexes and holds a 2-dimensional minimum nonzero area. All 2- 
simplexes for each neighborhood system are shown in the second line of Table 1 . 
The constructive definitions of 1 - and 2 -simplexes are presented in [8]. 

If an ni-simplex is a subset of an n2-simplex where ni < n2, the ni-simplex 
is called a face of the n2-simplex; it is also called an ni-face. For instance, a 2 - 
simplex for the 26 -neighborhood system has three 0- faces and three 1-faces. A 
set of all faces included in a discrete simplex [a] = [xi, X2, • • • , Xk] is denoted by 
face{[a\). Let the closed convex hull of k points, Xi, a?2, • • • , be denoted by 
CH({xi, X2, • • • , ^c/c})- The embedded discrete simplex is defined as 

||a||=CH([a])\( U cum) (4) 

[b]eface{[a]) 

for any n-simplex [a], and ||a|| is called the embedded n-simplex of [a]. An n- 
simplex and the embedded n-simplex are clearly different since [a] and ||a|| are 
defined as sets of points in and R^, respectively. 

Definition 2. A finite set K of diserete simplexes is ealled a diserete eomplex 
if it satisfies the following eonditions: if [a] c K, face{[a]) C K; if [a], [b] c K 
and ||a|| H ||6|| 7^ 0, then [a] = [b]. 

The dimension of K is equal to the maximum dimension of discrete simplexes 
which belong to K. Hereafter, we abbreviate n-dimensional discrete complexes 
to n-complexes as well as n-simplexes. Suppose that K is an n-complex. If there 
exist at least one n-simplex [a] G K for every 5-simplex [b] G K such that [b] G 
face{[a]) and s < n, K is called pure. If we can find a chain of discrete simplexes 
between two arbitrary elements [c], [d] G K, [ci] = [c], [02], . . . , [ck] = [d], such 
that [ci] and [q+i], i = 1 , 2 , . . . , /c — 1 , has a common face in K, K is called 
connected. 

Definition 3 . If a 2 -eomplex K is pure and eonneeted, K a diserete eombi- 
natorial surfaee. 

More discussion on discrete combinatorial surfaces in the sense of combinatorial 
topology is given in [8]. Note that discrete complexes and discrete combinatorial 
surfaces are constructed with respect to each neighborhood system. 
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Table 2. A set 51+ (z, k) for each of eight possible configurations of black and 
white points in C(i, j, /c), m = 6, 18,26. In the table, we consider P such that 
0<a<6<c, c>0. The configurations within parentheses are ignored for the 
construction of 51+ because black points in such C(z, j, k) are regarded as 0- or 
1-faces of 2-simplexes in the adjacent cubes of C(z, j, k) 




3.2 Construction of Discrete Planar Surfaces 

The following two regions in are separated by P: 



H = {(x, z) G : ax by cz d < 0} , 

H+ = {(x, z) G R^ : ax by cz d > 0} . 

Obviously, we have 

H n H+ = P . 



( 5 ) 

(6) 

( 7 ) 



Just as H and H+ in R^, there are two regions in Z^, which are separated by 
P as follows: 



I ={(x,^,z)gZ^ : ax ^ by cz d < 0} ^ (8) 

1+ = {(x, y,z) e : ax by ^ cz ^ d > 0} . (9) 

We say that 1“ and 1+ are the digitization of H“ and H+, respectively. For both 
1“ and I+, we can construct the boundaries which are discrete combinatorial 
surfaces with the m-neighborhood system for m = 6, 18, 26, denoted by 51“ and 
51+ , using the similar algorithm for boundary extraction [8]. Both 51“ and 51+ 
are considered to be the digitization of P and called discrete planar surfaces 

with respect to P. In this subsection, we present how to generate 51+ from I+. 

The same procedure can be applied to generate 51“ if 1+ and 51+ are replaced 
by 1“ and 51“ , respectively. 
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Algorithm 1 . 
input: I+. 
output: 51^. 
begin 

1. Points in 1+ and the complement (I^)' = \I+ are assigned black and 

white points, respectively; 

2. in any unit cubic region 

C(i, j, k) = {(x, y,z)^7j^\i<x<i-\-l,j<y<jPl,k<z<kpl} 

( 10 ) 

such that C{i,j, k)nl~^ ^ 0 and C{i,j, /c)n(I+)' 7 ^ 9, the black and white 
points have either of eight different configurations as shown in Table 2; 

3. for each 0(i,j,k), dlf^{i, j,k) is obtained as a set of 2-simplexes and 
their faces by referring to Table 2; 

4 . obtain 

end 

The next theorem is derived; the proof is given in [9]. 

Theorem 1 . Each dl+ (resp. dlf^) for m = 6, 18,26 obtained from 1+ (resp. 
1~ ) by Algorithm 1 is a discrete combinatorial surface and called a discrete 
planar surface. 



4 Relations between Naive Planes and Discrete Planar 
Surfaces 

Let B+ be the set of all lattice points in 51+ for m = 6, 18, 26, such that 

B+= U [a], (12) 

[a]edl+ 

Then, the following lemma is derived. 

Lemma 1 . For any plane P, the inclusion and equality relations 

— ^18 ~ 



hold. 

Proof. Using C{i,j, k) of (10), for each m, we define 

B+(i,j,fc)=B+nC(i,j,fc) (14) 

which is a subset of B+ . Let us compare a triplet of B+ (i, j, k), m = 6, 18, 26 
for every C{i,j,k) in Z^. If we make a comparison between B^(z,j, /c) and 
B+g(z, j, k) in Table 2, we see 

B^{i,j,k) D B+g(i,j,fc) 



(15) 
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Table 3. The classification of all points in 1+ into two types with respect to 
each configuration of Table 2: a set of black and gray points which are included 
in B Jg and a set of white points which are not included in B Jg 





for configurations P4b, P5, P6 and P7, otherwise we obtain 

Bg (*, j,fc) = j,fc). (16) 

Between B^g(z, j, k) and Bjg(z, j, /c), we see 

B]*8(i, j, A;) = Bjg(i, j,fc) (17) 

for any configuration in Table 2, even if k) and 9ljg(i, j, k) are different 

for P5 and P6. From (15), (16) and (17), we see that (13) holds. □ 

We then prove the next theorem. 

Theorem 2. For any P, the equality relations 

NP = B+ = B+g (18) 



hold. 

Proof. Let us consider P such that 0<a<6<c, c>0. In this case w = c. 
From (2) we obtain 



NP = {{x,y,z) G 



a b d a b d . . . 

—X y < ^ < X y ^ 1| • (19) 

c c c c c c 



For every point x = (x, z) in NP, if we set a point Ca, G P such that 



b d, 
c c 



Cx = {x, y, 



a 

— X 

c 



( 20 ) 
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then we see that 

0 <\x — Cx\ < I (21) 

from (19). Since B^g = BJq from Lemma 1, we will show that every cc c BJq 
satisfies (21) and no point in \Bjg satisfies (21) to prove this theorem. Let us 
consider a cubic region C(i, j, k) of (10). Table 2 gives all configurations of points 
in 1+ and (I^)' for a C(z, j, k). All black points in Table 2 are classified into black, 
gray and white points in Table 3. All black points in Table 3 apparently satisfy 
(21). Let us consider the case a > 0. For each gray point x = {x,y,z)^ if we set 
two points in P such as 






.6 c d . 

( — y - -z- -^y^z) 

a a a 



( 22 ) 



and 



we obtain 



, , a c d . 

K = (x, --X- -z- -,z) , 



\x O' 3 ? I ^ \x ^ \x Cjr 



(23) 

(24) 



since \x — ax\ \x — hx\ : \x — Cx\ = 1/a : 1/6 : 1/c from Lemma 2 in Appendix 
and 0 < a < 6 < c. We then see in Table 3 that every gray point x satisfies 



\x-ax\<l or \x-hx\<l, (25) 

and from (24) we obtain (21). In the case a = 0, we will have only P2, P4a and 
P6 for configurations of C(i,j, /c) in Table 3, and gray points exist only in P6. 
If 6 > 0, we set bx and have the second inequation of (25). Thus, we also obtain 
(21). If 6 = 0, we will have only P4a in which no gray point exists. Obviously, 
no white point in Table 3 satisfies (21). From a comparison between Tables 2 
and 3, we see that a set of black and gray points in Table 3 is equal to a set of 
points of B^g in Table 2. Thus, we have (18). □ 

Theorem 2 indicates that either of 5l/g or dl^Q is a triangulation of NP in 
the aspect of combinatorial topology. From Lemma 1 and Theorem 2, we obtain 
the next corollary. 

Corollary 1. For any P, the inclusion relation 

NP C B+ (26) 

holds. 

If NP = B^, we say that dl^ is also a triangulation of NP, but if NP c B^, 
it is not obviously. If we define a naive plane such that 

NP“ = {(x, z) G : —w < ax by cz d < 0} (27) 

instead of NP and set 

= U [a] 

[a]edlm 

for m = 6, 18, 26, then the following corollary is derived. 



( 28 ) 
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Corollary 2. 


For any P, the relations 






NP = B^g = B 20 


(29) 


and 


NP“ C Bg 


(30) 



hold. 



Equation (29) in Corollary 2 indicates that either of or 812q is a trian- 
gulation of NP“ in the aspect of combinatorial topology. 

5 Properties of Naive Planes as Discrete Planar Surfaces 

In this section, we discuss the local configurations of discrete simplexes in 51^, 
and dl 2 Q with respect to P. From Theorem 2 and the properties of local 
point configurations of NP which have been introduced in [4, 5, 6, 7], we can 
derive the combinatorial properties of 51^, and they are summarized 
in Propositions 1 to 5. Let us consider the configurations of discrete simplexes in 
the parts of 51+ for m = 6, 18, 26 which project on the coordinate plane z = 0 
as a rectangle whose sizes are X x fi. 

Proposition 1. In the case o/A = /i = 2, there exist five different configura- 
tions in NP5 and the corresponding configurations of discrete simplexes for 51+ , 
m = 6, 18, 26, are shown in Fig. 1 with respect to any P such that 0 < a < b < c, 
c > 0. 

Proposition 2. At most four different configurations of discrete simplexes for 
A = /i = 2 are contained in a 51+ , m = 6, 18, 26. 

Proposition 3. In the case o/A = /i = 3, there exist fO different configurations 
in NP5 and the corresponding configurations of discrete simplexes for 51+ , m = 
6, 18, 26, are shown in Figs. 2, 3 and respectively, with respect to any P such 
that 0 < a < b < c, c>0. 

Proposition 4. At most nine different configurations of discrete simplexes for 
A = /i = 3 are contained in a 51+ , m = 6, 18, 26. 




Fig. 1. All five configurations of discrete simplexes in 51^ (a), 5I^g (b) and 51 Jg 
(c), whose projections on plane z = 0 he on the 2x2 square grids 
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Fig. 2, All 40 configuratioiis of discrete smiplext's in 3Ift whose projections on 
plane 2 : = 0 lie on the 3x3 square grids. The star of each white pohit is also 
shown as discrete siniplexes with diagonal luies in the figures 



Propositions 1 to 4 give tlie coexistence of adjacent 2-simp lexes hi a , 
ni = 6, 18, 26, There are actually two simplicial configurations for P4a in 
and for PG in as shown in Fig, 5. For each 0-siinplex [x] € 

we can define the star such that 

o-{[a:] : ai+) = {[«] E 01 + : \x] e /ace{[a])} . (lU) 

Iji Figs, 3 and 4, we choose one of the configurations in Fig. 5 for P4a or Pfi so 
that the number of 2-simp lexes in the star of a white point [x] l^ecomes as small 
as possible. The projection of (T([as] : 51 on the coordinate plane 2 : = 0 is in 
a square wliose size is 3 x 3 if 0 < o < 5 < c and c > 0, From this fact, we also 
derive the following proposition. 
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Fig. 3, All 40 canligu rat ions of discrete smiplexes in whose project ions on 
plane 2 = 0 lie on the 3x3 square grids. The star of each white pohit is also 
siiown as discrete siinplexes with diagonal luies in the figures 



Proposition 5. Any dhcrete planar surface for m = 6, 18, 26 consists of 
2-simplexes and their faces so that every O-simplex [x] E has otie of the 
stars whose configumtions are illustrated in Figs. 2, S and 4t respectively. 

We see that the equivalent simplicial configurations of a star can appear in 
different simplicial configurations each of which projects on the coordinate plane 
2 = 0 as a 3 X 3 square in Figs. 2^ 3 and 4. Thus, the total nnniber of different 
conffgurations of discrete siinplexes of a star will be less than 40, i.e. 4, 18 and 
23 configurations for the (i-, 18- and 20-neighborhood systems, respectively. The 
similar results of local configurations for dl^ are also presented in [1^4]. 
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Fig. 4. All 40 cDiifigiirations of discrete simplcxcs in <51 wliose projections on 
plane z — 0 lie on tiie 3 x 3 square grids. The star of each white pouit is aLso 
shown as discrete simplexes with diagonal lines in the figures 



6 Conclusions 

In this paper, we first proved that and B25 are cqLial to NP. From the 
equality relations, we see that for m = 18, 2G is a triaiigulation of NP in 
the sense of combinatorial topology. Since consists of discrete simplexes. 
we described the local properties of by using configurations of discrete 
simplexes instead of those of lattice points, such as the coexistence of adjacent 
2-simplexes and the configuration of discretes simplexes of a star in a . If we 
set w = |a| + |5| H- |r| in (2), we obtain standard planes with respect to P of (1), 
uistead of naive planes [4] . It is oiir future work to clarify the relations between 
standmd planes and 01^^ m = 6, 18, 26. A part of this work was supp(5rted by 
JSPS Grant-in- Aid for Encouragement of Youiig Scientists (12780207). 
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(a) 




(b) 



TSIlSI 



Fig. 5. Two simplicial configurations of P4a for m = 18, 26 (a) and P6 for 
m = 26 (b) 
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Appendix: Lemma 2 

Let us consider P of (1) such that a,b,c > 0. For each point p G I"*" \ P such 
that p = (s, t, u), we set three planes such as 



s = {{x,y,z) e R® : 


a; = s} , 


(32) 


T = {{x,y,z)en^ 


: y = t}, 


(33) 


u = {(x,y,z) G 


: z = u} . 


(34) 




Fig. 6. Three points a, b and c defined for a plane P and a point p which is not 
in P 
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Let a, b and c be the intersection points of P, T and U, P, S and U, and P, S 
and T, respectively, as illustrated in Fig. 6. Then the next lemma is derived. 

Lemma 2. For any p c 1+ \ P, we obtain 

\p — a\ : \p — b\ : \p — c\ = 1/a : 1/h : 1/c (35) 



where a, 5, c > 0. 



Proof. The equation of the line which is the intersection of P and U is given by 
ax F by cu F d = 0. Thus, the slope of the line in U is given by 



\p-b\ ^ a 
\p-a\ b ' 



(36) 



Similarly, the slopes of the intersection lines between P and T, and P and S, 
are respectively given by 



\p-c\ ^ b \p-a\ ^ c 

\p-b\ c \p-c\ a' 



(37) 



From (36) and (37), we obtain (35). 



□ 



Surface Digitizations by Dilations Which Are 

Tunnel- Free 



Christoph Lincke and Charles A. Wiithrich 



Computer Graphics, Visualization, Man-Machine Communication Group 
Faculty of Media, Bauhaus University Weimar 
99421 Weimar, Germany 



Abstract. In this article we study digital topology with methods from 
mathematical morphology. We introduce reconstructions by dilations 
with appropriate continuous structural elements and prove that notions 
known from digital topology can be defined by continuous properties 
of this reconstruction. As a consequence we determine the domains for 
tunnel- free surface digitizations. It will be proven that the supercover 
and the grid-intersection digitization of every surface with or without 
boundary is always tunnel- free. 



1 Introduction 

Various approaches have been made to study geometrical and topological prop- 
erties of binary digital images. Discussing the advantages and disadvantages of 
them would be far beyond this paper. However, there is a growing interest in 
relating these approaches to each other in order to develop a foundation for a 
mathematically consistent theory. 

The most well-known approach, known as digital topology [KR89] , is derived 
from graph theory. Elements of 7/^ are interpreted as vertices. Edges are defined 
by different adjacency relations between object and background points. This 
approach serves well for two-dimensional image analysis. The 3D case [MR8I] 
is far more complicated and a generalization to higher dimensions has not been 
made yet. 

A cellular approach has been applied by Kovalevsky [Kov89] in 2D and by 
Herman et al. [HW83] in 3D. Voss [Vos88] studied a dual cell- structure in 7'^ and 
Khalimsky [KKM90] developed a topological approach based on connected or- 
dered topological spaces. The structure studied in these approaches is the discrete 
or Alexandrov topology. It is equivalent to a tessellation of by n-dimensional 
unit cubes. Each approach maps 7^^ onto different elements of the structure. In 
the first case an element of 7'^ is associated with an open n-dimensional unit- 
cube, whereas Voss interprets 7'^ as the set of the vertices of these cubes. In the 
third approach, every element of that structure is associated with an element 
of Z^. 

Bertrand and Couprie [BC99] proposed a model for digital geometry that 
associates two orders to each subset of 7'^. These order relations correspond to 
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the different adjacencies as used in digital geometry. Moreover, the authors have 
proven that the notion of surfaces and simple points in their model correspond 
exactly to that very notions in digital topology. This justifies the original graph 
based approach for 2D and 3D space. 

Digital images can also be investigated using a digitization approach [Ser82] . 
A discrete object has a certain property if it is a digitization of an appropriate 
continuous object with that property. Dual to this is the embedding approach in 
which continuous analogs [KR85] of discrete objects are studied. Both approaches 
define properties of discrete objects by well-known continuous, usually Euclidean, 
notions. 

Reveilles [Rev92] introduced the arithmetical geometry approach. He defined 
discrete analytical objects as discrete objects which are the integer solution of 
a finite set of inequalities. Recently, Andres studied the supercover digitization 
of m-fiats in the context of discrete analytical objects [And99]. In [LWOOb] we 
generalized these results to linear analytical objects. 

The digitization approach has been related to digital topology. Various re- 
searchers [Pav82,Ser82,GL95] studied the preservation of topological features of 
continuous objects under digitization. Baerentzen et al. [BSCOO] presented a cri- 
terion for determining whether a 3D solid is suitable for digitization at a given 
resolution. Although these articles make different assumptions, the common idea 
is to consider objects that are morphologically open and closed by a closed ball 
whose radius depends on the grid resolution. 

We applied a similar approach to study the digitizations of surfaces without 
boundary [LWOOa]. Since the opening of a simple surface is the empty set, we 
employed morphologically closed surfaces with respect to a ball of a radius r. We 
called these objects r-surfaces. Contrary to Baerentzen et al. [BSCOO] our ap- 
proach did not take a reconstruction kernel into account, but this article explains 
that properties from digital geometry, such as connectivity and separability, can 
defined by a dilation of the discrete object with a continuous structural element. 

An extension of these results for surfaces without boundary to surfaces with 
boundary is essential, because many real-life objects can be described as the 
union of surface patches. To evaluate the quality of digitizations of surfaces with 
boundary Cohen-Or et al. introduced the notion tunnel-free [CK95]. This notion 
has been applied successfully to polygons and polyhedra [ANF97]. In this paper 
we develop a theoretical framework for digitizations of surfaces with and without 
boundary which is based on mathematical morphology [Ser82,Hei94]. The same 
theoretical background has been used by Schmitt to study digitizations and 
connectivity [Sch98]. 

This article is outlined as follows: Section 2 states the basic definitions from 
differential geometry, digital topology and mathematical morphology. In sec- 
tion 3 important results about surface digitizations and digitizations by dilation 
will be recalled. In the following section reconstructions by dilation will be intro- 
duced and in section 5 this notion is employed to prove a condition under which 
a surface digitization is tunnel-free. We conclude with a summary and remarks 
on future work. 
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2 Basic Definitions 

2.1 Differential Geometry 

In differential geometry continuous objects are studied as their parametrization. 
Curves are basically 1 -dimensional and surfaces are (n — l)-dimensional para- 
metrizations in W^. Thus, in curves can be considered as surfaces. 

A set of points C C (n > 2) is said to be a {I)-curve {r > 1) if there 
exists an open interval /CM and an r times continuously differentiable function 
7 : / ^ M’^ such that C = 7 (/). The function 7 is called parametrization. A 
curve 7 is smooth if, for all tel, the first derivative exists and is non-zero. A 
curve is simple if it has no self-intersection. 

From now on all curves will be considered to be smooth and simple. A curve 
C = 7 ([a, 6 ]) with end points 7 (a) and 7 ( 6 ) is a subset of curve j{I) that is 
defined on an appropriate open interval / that contains [a, b]. Let C = 7 ((a, b)) 
be a simple curve and let 7 (a) = 7 ( 6 ), then C = 7 ([a, b]) is a simple elosed eurve. 

A set of points 5 C M’^ (n > 2) is said to be a C'^ (U)-surfaee (r > 1) if there 
exists a non-empty open set U C and an r-times continuously differentiable 

function f : U such that S = f{U). 

Again, only simple, smooth surfaces with or without boundary are considered. 
A simple surfaee without boundary is either a closed surface, such as a sphere, or 
an infinite object homeomorphic to a hyperplane. These notions are intuitively 
clear and similar to those for curves. For a detailed definition the reader is 
referred to text books on differential geometry such as [LV85]. 



2.2 Digital Topology 

We define a diserete objeet A as a subset of Its complementary set = 

\ A is called the baekground. We think of Z'^ as a subset of n-dimensional 
Euclidean space An element z e Z'^ is called a grid point. 

There are various equivalent ways to introduce the basic notions of digi- 
tal topology. We define the neighborhood of grid points through Voronoi sets 
[Kle85,Wiit98]. Other definitions are based on distances or differences in the 
coordinates of these points. 

The Voronoi set V{z) of a grid point z is the set of all points in which are 
at least as close to z as to any other grid point. V(z) is a closed axes-aligned n- 
dimensional unit cube with center z. The Voronoi sets of a 2D and 3D grid point 
are known as pixel and voxel, respectively. Neighboring n-dimensional Voronoi 
sets can share a point, a straight line segment, up to an (n — 1 ) -dimensional 
cube. 

Two grid points z,z' e Z^ are said to be k -neighbors (0 < A; < n — 1) if 
their Voronoi sets share a point set of dimension k or higher, i.e. if dim(V(z) H 
V(V)) > k. 

A sequence {zq, . . . ,zi) of points of an object A C is said to be a k-are 
from Zq to zi in A if successive elements are /c-neighbors. AT C Z^ is a (simple 
elosed) k- eurve if each point of K has exactly two /c-neighbors. 
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Fig. 1. Dilation A © 5 and erosion A 0 5 of a set A by a structuring Element B 



An object A C is k- connected if there exists a /c-arc in A from z to z' 
for any points z^z' G A. A k- component of A C Z’^ is defined as a maximal k- 
connected non-empty subset of A. 

A discrete object A C Z^ is said to be k -separating if the background 7/^ \ 
A consists of exactly two /c-components. A /c-separating object A is called k- 
minimal if for any z G A A \ {z} is not /c-separating. A k-separating surface 
(without boundary) is a minimal /c-separating object. 

To avoid pathological situations in 2D, a 1-curve must consist of at least 8 
points and an 0-curve of at least 4 points [KR89] . A discrete surface should have 
no touching points. Traditionally [KR89], in Z^ 1- and 0-neighbors are called 
4- neighbors and 8-neighbors, respectively, and in Z^ 26-, 18- and 6-neighbors 
are common notions. 



2.3 Morphological Definitions 

In this article, morphological operations [Ser82,Hei94] on point sets will be re- 
quired. Let A and B be two subsets of Since Z’^ C the following opera- 
tions can be applied to continuous as well as to discrete point sets. 

A (B B = {a b : a E A, b G B} is called Minkowski addition and A Q B = 
{p : 6 + p G A for all 6 G -B} is the Minkowski subtraction of A and B. In math- 
ematical morphology, A(B B and AQ B are known dilation and erosion of A by 
the structuring element 5, respectively. Fig. 1 shows an example. 

The operation A o B = {A Q B) (B B is called the opening and A • B = 
{A(B B) Q B is called the closing of A with respect to B. A is morphologically 
open (closed) with respect to B if Ao B = A (resp. A^ B = A). 

Finally, A^ = A © {z} is the translate of A by z and A = {—a : a G A} 
denotes the reflected set of A. 

3 Surface Digitizations 

In our previous work [LWOOa] we studied a class of digitizations, commonly 
known as digitizations by dilations [Hei94]. The grid-intersection [Kle85] and 
the supercover [CK95] digitization schemes, which are common for surfaces, are 
special cases of digitizations by dilations. 
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Fig. 2. Digitization of a curve as the set of translated basic domains Dz hit by A 
(left) and as the set of grid points contained in A (B D 



A digitization by dilation with domain D C is a function ^ 

that is defined as A^{A) = {z ^ 7A : An Dz ^ 9} for every continuous 
object A C 

By virtue of this definition, a grid point z belongs to digitization Z\^(A), 
if and only if Dz, the domain translated to z G hits the continuous ob- 
ject A. A^{A) is called digitization by dilation because it is the set of grid 
points contained in the dilation of A by the reflected domain D, i.e. A^{A) = 
(A©D)nZ^ [Hei94]. 

If the domain of a digitization is not specified then A{A) is any subset of 

that is intended to serve as a discrete approximation of a continuous object 
A C It does not need to be a digitization by dilation when we focus on 
criteria for the quality of these approximations. 

The criterion “/c-separating”can only be applied to discrete surfaces without 
boundaries. To overcome this limitation the notion of a /c-tunnel-free digitization 
of a surface has been introduced [CK95]. 

Let (zo, • • • 1 zi) be a k-arc. Then the continuous polygonal arc consisting of 
the straight line segments [zq, ^i], [^i, ^ 2 ], • • • , [^i-h z{\ is called a continuous k- 
path. A continuous path tt hits a surface S C in a point p^S if pGirHS. 
A continuous path tt crosses a surface S C in p G 5 if there exists an e > 0 
such that TT hits two different components of Be{p) \ S. Be{p) denotes the closed 
ball of radius e with center p. 

A digitization A[S) C Z^ of a continuous surface S C is k-tunnel-free 
(0 < A; < n — 1) if every continuous /c-path in (Z\(5))^ = ZA \ A{S) does not 
cross S. A continuous A;-path in (A{S))^ that crosses S is called k-tunnel. 

If A{S) C Z'^ is a /c-tunnel-free digitization of S then Z\(*S') U A is also k- 
tunnel-free for every A C Z’^. As illustrated in Fig. 3, a /c-tunnel-free digitization 
of a continuous surface without boundary is not necessarily a /c-separating dis- 
crete object. 

4 Reconstructions by Dilation 

The foundation of digital topology is the notion “/c-neighbor” . In Section 2 k- 
neighborhood of grid points was defined by means of their Voronoi sets. In this 
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Fig. 3. A 1 -tunnel-free (left) an 0-tunnel-free (right) digitization of a simple 
closed curve, that are no discrete curves 



section will be shown that two point z^z' G are neighbored if U Rz' — 
{{z} (B R) U {{z'} (B R) is a connected set in for an appropriate structural 
element C Consequently, if A G is a discrete object, we can think of 
the continuous set (A 0 i?) C as its reconstruction by dilation. 

Let X = (xi, . . . , Xn) and y = (^i, . . . , yn) be two points in then [x, y] 
and [(xi, ... , Xn), (?/i, . . . , ^n)] denotes the straight line segment between these 
points. R^^^ G is the union of all line segments [(0, ... ,0), (xi, ... , x^)] with 
the property that at least k {0 < k < n) of the coordinates Xi are 0, while the 
others are either | or — 

For simplicity the index for the dimension will be omitted if it is clear by the 
context. Fig. 4 shows the sets Rq and R\ in and i?o, Ri and R 2 in The 
following lemma is a simple conclusion of the definitions of Rk and /c-neighbors. 

Lemma 1. Two grid points z^ z' G are k -neighbors (0 < k < n — 1), if and 
only if the reconstruction {z, z'} (B Rk 'Is a connected set in 

There exists a straight line segment between /c-neighbored two grid points 
in the reconstruction by Rk and, conversely, if there is a straight line segment 
between two grid points z and z' in {z, z'}(BRk then these points are /c-neighbors. 
Notice that there may exist other continuous paths between these points. Only 
for (n — l)-neighbors this path is unique. A /c-arc from z to z' in A C exists, 
iff there exists a continuous path between z and z' in A (B Rk- 

Lemma 2. An object A is k-connected if and only if the reconstruction 

A(B Rk 'Is a connected set in MA . 




Fig. 4. Ri and Rq in R^ and R 2 , Ri and Rq in R^ 
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Proof. Assume A C is /c-connected. Then, for any z,z' C A, there exists 
a continuous path between z and z' in A (B Rk- Rk is a connected set. Hence, 
for every z" C there exist a continuous path between z" and every point of 
{z"}(BRk- Consequently, there exists a continuous path between any two points 
in A Q Rk, i.e. AQ Rk is connected in W^. 

Conversely, suppose A(BRk is connected in By construction of Rk there 
exist a continuous /c-path between any grid points in A and A is /c-connected. 
q.e.d. 

As a result of this lemma /c-components can be defined as continuous compo- 
nents of the reconstruction by Rk- The proofs of the following two lemmas will 
be omitted. They are similarly simple. 

Lemma 3. A discrete object A C Z’^ is k-separating iff (Z^ \ A) (B Rk consists 
of exactly two continuous components. 

Lemma 4. A k-separating object A is k-minimal iff {{Z^ \ -^) C {z}) © Rk is 
not separating for all z ^ A. 

Let us conclude this section with some remarks. The reconstruction by dila- 
tion with a structural element Rk represents only the number of components of 
a discrete object. For example the reconstruction of a discrete curve or a surface 
is not a continuous curve or surface. It also does not reconstruct the number of 
background components. 

Instead of Rk we could have used other structural elements such as the convex 
hull of Rk or the smallest closed ball that contains Rk- For these examples all 
results of this section would still hold. 

We have chosen the term “reconstruction by dilation” to represent a dual 
or opposite operation of digitization by dilation. It should be pointed out that 
this term is also used in mathematical morphology with a different meaning. In 
the context of geodesic transformations there is a notion of “reconstruction by 
dilation of a mask image from a marker image” [Soi99] which is not related to 
our definition. 

5 Tunnel-Free Surface Digitizations 

In this section tunnel-free digitizations by dilations will be studied for surfaces 
with or without boundary. The first theorem establishes a sufficient condition 
such that a digitization by dilation is tunnel-free. 

Theorem 1. Let A{S) be a digitization of a surface S G MA such that the 
reconstruction of the background (B Rk does not hit S. Then A(S) is k- 

tunnel-free. 

Proof. Let us assume A{S) is a digitization of S such that {A{S))^ © Rk 
does not hit the surface S. Suppose that there exists a /c-tunnel in A{S). Then 
there exists a continuous path in (A (5))^ © Rk that hits the surface 5, which 
is a contradiction to our assumptions. 
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Note that the condition “the reconstruction of the background by Rk does 
not hit the surface” is not necessary for a /c-tunnel-free digitization. Using this 
theorem, a relationship between existence of /c-tunnels a digitization by dilation 
and the choice of the domain D can be proven. 

Theorem 2. Let A^ be a digitization by dilation with the domain D A. Rk- 
Then A^{S) is k-tunnel-free for every surfaee S . 

Proof. Let A^ be a digitization by dilation with the domain D A Rk- For 
every S C A^^{S) is a subset of A^{S). The digitization A^^{S) contains 
all points z e such that z (B Rk hits S. The construction of its background 
{A^^{S))^(BRk does not hit S. Hence, A^^{S) is /c-tunnel-free and so is A^iS). 

This theorem justifies the grid-intersection [Kle85] and supercover digitiza- 
tion [CK95] as appropriate digitizations schemes for surfaces with or without 
boundary. 

The grid-intersection digitization is a digitization by dilation whose domain 
is Rn-i- As a consequence of Theorem 2 every grid-intersection digitization of 
surfaces is always (n — 1) -tunnel- free. The domain of the supercover digitization 
is V(0), which is a superset of Rq. Hence, the supercover of every surface is 0- 
tunnel-free. 

6 Summary and Future Work 

In this article we investigated digital topology with methods from mathemat- 
ical morphology. We introduced reconstructions by dilations with a structural 
element Rk- We have proven that important notions from digital topology, such 
as /c-neighbors, /c-connected and /c-separating objects, can be defined by contin- 
uous properties of the reconstruction dilation with Rk- 

As a consequence the new notions have been used to prove that every digi- 
tization by dilation whose basic domain is a superset of Rk is /c-tunnel-free. In 
particular the grid-intersection digitization and the supercover of every surface 
is always 0-tunnel- free and (n — l)-tunnel-free, respectively. 

Currently we are relating our work on r-surfaces [LWOOa] to the results of 
this paper in order to obtain a theoretical framework for the digitization of 
surfaces with boundary. Our future work includes also an algebraic study of 
the relationship between digitizations and reconstructions by dilations on the 
abstraction level of windowing functions. 
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Abstract. The use of three-dimensional digitizers in computer vision 
and CAD systems produces an object description consisting of a collec- 
tion of scattered points in . In order to obtain a representation of the 
objects’ surface it is necessary to establish a procedure that allows the 
recovering of their continuity, lost during the data acquisition process. A 
full automatic O(n^) algorithm is presented. Such algorithm obtains sur- 
face representations of free genus objects described from a set of points 
that belong to the original surface of the object. The only information 
available about each point is its position in R^. The achieved surface is 
a Delaunay triangulation of the initial cloud of points. The algorithm 
has been successfully applied to three-dimensional data proceeding from 
synthetic and real free shape objects. 

Keywords: Automatic surface reconstruction, 3D Delaunay triangula- 
tion, 3D Modeling 



1 Introduction 

This article deals with the problem of tessellating real objects’ surface represen- 
tations in an automatic way. The free genus objects are exclusively defined by a 
set of three-dimensional points that belong to their surface. This problem can be 
found in areas such as computer vision or CAD, that accept object descriptions 
based on 3D scattered points. The input data is composed by a set of unstruc- 
tured points which represent the samples digitized from the surface of the objects 
using passive or active acquisition techniques [23]. Furthermore, they could be 
obtained from the segmentation of 3D volumetric images, although this approach 
is more frequently used in medical environments because of its high cost [20]. 
From this set of points, represented by their 3D coordinates, we are intending 
to automatically build a mesh of triangular facets produced by the proximity 
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relationships between the points of the surface. Another desirable property of 
the mesh is to be as regular as possible. So, the extracted mesh under these 
conditions will be a Delaunay triangulation. 

1.1 Related Work 

The use of 3D digitizers to capture the shape of the objects is more and more 
frequent in computer vision and CAD systems. This kind of devices make a dis- 
crete sampling over the surface of the object obtaining a description based on a 
cloud of points. However, we know few works in that deal with the automatic 
recovering of the surface continuity without more information that the points’ 
coordinates. Most of the existing techniques manage some type of additional 
information in order to characterize the cloud of points. The shapes of Edels- 
brunner et al. [10] are a good effort to formalize the concept of “shape” for the 
discussed descriptions. Several solutions require to use some kind of manipula- 
tion from a human being, making the process highly interactive [5,21]. In other 
cases, some limits to the data sampling process are fixed [22,8,1]. For example. 
Amenta and Bern restrict the sampling distance to avoid the problems that ap- 
pear with the existence of creases and corners. Other methods need to know 
the orientation or the position of the sensor during the sampling stage in order 
to recover the connectivity between the points of the surface [12,26]. Another 
approach is to use the orientation of the normals associated to each point to 
establish neighbor relationships between the samples [4]. Hoppe et al. [19] and 
Bajaj et al [3] define a signed distance function to compute its zero-set, consid- 
ering the surface as an implicit function defined over the input data. Another 
alternative approach used by some authors consists to perform a deformation 
over a reference mesh by means of an iterative process which adjust the mesh 
to the cloud of points [25,27,18]. The only known work in 3D that deals with 
the stated problem is the Attali’s one [2], where the boundary and surface ex- 
traction in two and three dimensions is intended to be formalized. In 2D she 
extracts the boundary from a Delaunay triangulation and uses the existence of 
topological relationships between neighbor points with the so-called r-regular 
shapes. However, the extrapolation of this problem to 3D can not be formalized 
in the same way so she offers a heuristic solution to extract the surfaces, but 
limited to closed surfaces. 

The contents of this paper is as follows. Section 2 describes the algorithm 
implemented to obtain the Delaunay tetrahedralization. Section 3 presents the 
algorithm that allows the computation of the objects’ surface Delaunay trian- 
gulation in an automatic way. Section 4 shows some meshes extracted with the 
proposed algorithm. Last, section 5 summarizes the paper’s main conclusions. 

2 3D Delaunay Tetrahedralization 

It is well known that there is an equivalence relationship between a Voronoi 
diagram and the corresponding Delaunay triangulation that allows changing 
from one type of representation to its dual by means of a 0{n) process [12]. 
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For our purposes, we have chosen the incremental algorithm described by 
Faugueras in [12] to obtain a Delaunay tetrahedralization of the cloud of points, 
although there are other methods to compute the Delaunay representation 
[17,11,6,9]. It is a 0{n^) algorithm that can be summarized in the next steps: 

1. Compute a Delaunay tetrahedralization of the vertex of a cube {Vi, i = 
1, . . . , 8} that contains all the points {Mj \ Mj G j = 1, . . . , p} belonging 
to the surface of the object, and the centers {Ci, i = 1,...,8} and the radius 
{Ri, z = 1, . . . , 8} of the circumscribed spheres of the computed tetrahedra. 
The centers of the spheres are the Voronoi points. 

2. If Mj is the current point to be inserted in the tetrahedralization and k 
is the current number of tetrahedra, each new point Mj will belong, at 
least, to one of the current spheres. If it would belong to p spheres, all the 
tetrahedra of those spheres must be marked in order to be deleted from the 
tetrahedralization, because the Voronoi region R{pi) is a convex polyhedron 
that contains the corresponding pi generator inside it. E.g., if dP{Mj,Ci) — 
R^ < 0 i = 1, . . . ,k then mark the i-th tetrahedron Ti . 

3. For all the marked tetrahedra, extract the list of their faces. 

4. Remove from this list all the faces that appear twice. 

5. For each non-removed face, create a new tetrahedron with Mj and insert it 
in the tetrahedralization. 

6. Delete from the tetrahedralization all the marked tetrahedra and return to 2 
until there are no more points to process. 

One of the main advantages of this algorithm is the local nature of the opera- 
tions performed from steps 3 to 6, because the point Mj deals with a reduced 
number of Delaunay tetrahedra. This allows a local and simple update of the 
tetrahedralization. 

To solve the degeneration cases of the Voronoi diagram computed with this 
algorithm, produced by numerical errors or by the own location of the points, it is 
possible to lightly modify the coordinates of the new point until the degeneration 
disappears and then insert it in the usual way. Another option is to discard this 
point from the cloud and not to include it in the tetrahedralization. 

3 Surface Extraction Algorithm 

The purpose of the surface extraction algorithm is first to remove the tetra- 
hedra of the simplicial complex resulting from the Delaunay tetrahedralization 
described in section 2 that are outside of the object, and then extract the ex- 
ternal surface as the list of facets of the simplicial complex that only belong to 
one tetrahedron. In this process it is necessary to use a heuristic to determine 
those tetrahedra that are inside of the object, because there is not a determinis- 
tic procedure to do this only knowing the Cartesian coordinates of the sampling 
points. Our proposed heuristic is to start with the tetrahedra placed inside of 
the convex hull of the objects’ cloud and remove those that offers to the outside 
any facet greatest than some given threshold, repeating the process until the 
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size of all the exterior facets is under the threshold. There are several ways to 
measure the size of a facet, for example, the perimeter, the area or the radius of 
the circumscribed circle. We have chosen the perimeter for two main reasons: 

1. From a computational point of view, it is faster and easier to compute than 
other measures. 

2. The tests performed have shown a better response of this measure than the 
others mentioned. We think that this is due to the fact that our starting 
simplicial complex is a Delaunay triangulation. As it is well known, the De- 
launay triangulation has the property that the partition obtained maximizes 
the minimum angle of the triangular facets achieved. Although this property 
approximates the triangular facets to equilateral triangles, sometimes the 
position of the samples makes very irregular triangles, so the area or the 
radius are not very good measures to clean the tetrahedralization. With this 
in mind, we are interested to compute a mesh where the distance between 
the samples, remember that the samples are the vertices of the triangles, is 
minimum. 

Following is the description of the algorithm that we propose. 

Algorithm 

1. Begin using the Delaunay tetrahedralization computed from the in- 
put cloud of points with the algorithm described in section 2. 

2. Discard all the tetrahedra having a vertex {Vi^i = 1,...,8} that 
belongs to the external cube defined in step 1 of the tetrahedraliza- 
tion algorithm (section 2). The result is a simplicial complex of the 
convex hull of the input data. 

3. Compute the external surface of the complex as the list of facets that 
only belong to one tetrahedron. 

4. Compute the threshold Vu as the mean of the perimeters of the facets 
in the external surface list. Sometimes it will be interesting to mul- 
tiply this value by a factor, as will be explained below. 

5. If a facet in the external surface list exceeds the threshold Vu^ remove 
from the complex the tetrahedron who owns the facet and insert the 
remainder three facets in the surface list. 

6. Repeat step 5 until all the perimeters of the facets in the surface list 
are below Vu- 

7. Recompute the mean perimeter of the external surface. If the differ- 
ence with respect the old value is up 1%, return to step 4. 

The most time consuming step of this algorithm is, by far, the calculus of the 
Delaunay tetrahedralization in step 1. The main disadvantage of the method 
is that it employs a global threshold for all the surface, so if the input points 
are not registered in a uniform way and there is too much variability in the 
perimeter measures along the surface, some holes will appear over the areas 
with less density of points. This could lead to the loss of some of the points from 



276 Angel Rodriguez et al. 



the original input representation. Sometimes, instead of computing the surface 
in a fully automatic way, it is possible to achieve better results if we multiply 
the mean perimeter value by a factor between 1.0 and 2.0 in the step 4 of the 
algorithm. 

The complexity of the algorithm from steps 2 to 7 is O(n^), where n is the 
number of tetrahedra of the tetrahedralization. Each iteration treats n tetrahe- 
dra, and in the worst case, we need to loop n — 1 times. 

4 Experimental Results 

The main purpose of this section is to present some representative objects used 
in the experiments and the results achieved with the algorithm proposed. The 
meshes shown in this section have been generated on a Silicon Graphics Ori- 
gin2000. The main hardware features of this multiprocessor are 2 GB of RAM 
and eight 250 MHz RIOOOO MIPS processors, although our actual implementa- 
tion computes only over one of the eight available processors. The application is 
implemented in G-1-+ and has been ported successfully to several UNIX based 
platforms (Linux). Although the development has been performed on a propri- 
etary computer, we have used free development distribution tools to assure the 
portability of the code. The development tools have been those that provide 
GNU [16], and the 3D visualization of the objects has been carried out with 
Geomview [14]. 

The clouds of points presented with more detail in this paper belong to two 
synthetic objects, the goblet and the sea shell, proceeding from the GTS Li- 
brary [24]. GTS is an Open Source Free Software Library intended to provide a 
set of useful functions to deal with 3D surfaces meshed with interconnected tri- 
angles. A third object with higher genus than the previous has been considered 
too with a similar detail: a human skull extracted from 3D tomographic real 
data. A fourth object proceeding from the Large Geometric Models Archive of 
the Georgia Institute of Technology is presented: a human skeleton hand [15]. In 
this case, the number of available samples is very higher, but the hand presents 
areas where the density of points is very variable too. The reason to show these 
examples is that the fourth objects present very different geometric features that 
allow to illustrate how the algorithm works. The selection of the first two syn- 
thetic shapes is motivated by the fact that the available sensors can not capture 
the internal geometry of the objects, like the details recovered inside the sea 
shell. Table 1 summarizes some statistics of the examples shown, together with 
the data of the other meshes extracted from some of the real objects used in the 
tests. It includes the number of points of the clouds describing the geometry of 
the objects, the number of vertices and facets of the final meshes and the exe- 
cution time of each one of the most demanding stages of the whole process: the 
3D tetrahedral Delaunay computation, and the final triangular facet extraction. 
Although there is some loss in the number of vertices with respect the number 
of input samples, the examples show that the shape, globally and locally, is cor- 
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Table 1. Statistic data about some of the objects managed in the experiments 



Object 


No. of 
input 
points 


No. of 
output 
vertices 


No. of 
output 
facets 


Exec, time(s): 
3D Delaunay 
Tetrahedra 


Exec, time(s): 
Triang. mesh 
Extraction 


Total 

Execution 

Time(s) 


Bowl 


1093 


685 


1412 


6.800 


0.988 


8.00 


Cup 


3051 


2791 


5630 


14.773 


1.087 


16.391 


Amphora 


6287 


6041 


16880 


45.757 


4.781 


51.48 


Human skull 


6391 


5707 


14158 


63.220 


3.570 


68.172 


Tip of an arrow 


7609 


5405 


12342 


85.305 


5.838 


92.561 


Sea shell 


7782 


5379 


10906 


38.298 


2.728 


42.259 


Hand 


52704 


47342 


144166 


1090.171 


41.158 


1132.884 




(a) Input data of the 
goblet. 




(b) Erosion with the 
convex hull criterion. 




(c) First view of the fi- 
nal mesh. 




(d) Second view of the 
final mesh. 



(e) Some detail of the 
base of the goblet. 



(f) Slice view of the fi- 
nal mesh. 



Fig. 1. The goblet. No. of points: 3051. No. of facets: 5630 



rectly recovered. The most of the points removed are discarded in the Delaunay 
tetrahedralization step due to numerical errors. 

The first object, Fig. 1, is a goblet that proceeds from [24]. The surface 
description of the goblet is a set of 3051 3D points. Figure 1(a) shows the input 
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(a) Input data. 



(b) First view of the fi- (c) Second view of the 
nal mesh. final mesh. 




(d) Some detail of the 
output mesh. 



(e) Interior view of the 
final mesh. 



(f) Some detail in the 
inside of the output 
mesh. 



Fig. 2. The sea shell. No. of points: 7782. No. of facets: 10906 



cloud of points that describes the geometry of the goblet. The mesh obtained 
after the first stage of the algorithm is shown on figure 1(b). This is the stage 
where the algorithm makes an erosion of the tetrahedra whose circumscribed 
spheres have centers outside of the convex hull. Figures l(c)-l(f) show several 
views of the final mesh. Figure 1(c) overlaps the cloud of points with the output 
mesh, like Fig. 1(e), but in this one, we have performed a zoom over the base 
of the goblet to take a closer look at the extracted mesh and the input data. 
The black dots represent the 3D input points. The extracted mesh is a Delaunay 
triangulation of a subset of the input data. As it is shown in this and in the next 
figures, it can be considered that the corners, holes and cavities are satisfactory 
recovered. 

The second object. Fig. 2, is a synthetic sea shell taken from the GTS Li- 
brary [24]. Fig. 2(a) shows the input data; Fig. 2(b) shows the mesh achieved 
after applying the algorithm described in section 3 with the superposition of the 
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(a) Volumetric data of (b) Cloud of points 
the human skull. extracted from the 

volumetric data of 
Fig. 3(a). 



(c) First view of the 
human skull. 




(d) Second view of the (e) A closer view 
human skull. showing some detail 

of the human skull. 



Fig. 3. The human skull. No. of points: 6391. No. of facets: 14158 



input cloud of points; Fig. 2(c) presents another view of the final mesh; we have 
applied a zoom process over the shell’s opening in Fig. 2(d) for a better view of 
the details of the sea shell hole; in Fig. 2(e) we see a cut of the final mesh in 
order to see the inside and the cavities of the output mesh; finally, in Fig. 2(f), 
we present a zoom over the half-bottom interior of the sea shell. 

The third object is a human skull (Figure 3). The main problems of this shape 
are the high complexity and the variable density of the cloud of points, presenting 
zones where there are very few points, for example in the eye holes, and others 
where the distribution is more uniform. Like in the previous figures, we present 
the input data. Fig. 3(b) and several views of the final mesh. Fig. 3(c)-3(e). 

Figure 4 shows the surface extracted from a human hand. It presents some 
difficulties, like the inter-finger space or the finger tips, but in all the cases the 
surface recovered can be considered very close to the ideal. 
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(a) A photograph 
taken from the orig- 
inal human skeleton 
hand. 



(b) Cloud of points 
extracted from the 
human skeleton hand. 



(c) Erosion with the 
convex hull criterion. 




(d) First view of the 
human hand. 



(e) Second view of the 
human hand. 



Fig. 4. The human hand. No. of points: 47342. No. of facets: 144166 



Last, in Fig. 5 we present the shape of the other objects mentioned in Table 1. 
In this case, the digitization of the bowl and the tip of an arrow has been 
carried out with a hand-held 3D digitizer. This device is a tactile low-price 
sensor manipulated by a human operator, which allows the simulation of any 
other kind of sensor that provides scarce and irregularly sampled measurements. 
It provides exclusively geometric information (only the points’ coordinates are 
kept). The amphora has been digitized with a laser-based sensor, which provides 
a more regular sampling than the hand-held digitizer, but its prize is considerably 
higher too. 

The current version of the implemented algorithm allows the specification 
of a threshold defined by the user. Sometimes, the geometry of the object does 
not allow to reach satisfactory results in a fully automatic way and the operator 
has the possibility to decide his preference: reducing the value of the threshold 
achieves a finer resolution mesh, but may produce some undesirable holes on the 
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(a) Input data from a 
bowl. 



(b) Surface extracted 
from Fig. 5(a). 



(c) Input data from 
the amphora. 




(d) Surface extracted 
from Fig. 5(c). 



(e) Input data from 
the tip of an arrow. 



(f) Surface extracted 
from Fig. 5(e). 



Fig. 5. Objects digitized with a laser-based sensor (amphora) and with the hand- 
held digitizer (bowl and tip of an arrow) and meshes obtained 



final mesh. Usually, this defect is not present in the meshes generated with a 
more uniform distribution, but it may be noticeable if the density of points over 
the object’s surface is very variable. 

We must notice that all the examples shown have been refined by the user 
specifying a threshold very similar to that computed in the fully automatic 
process. 

5 Conclusions and Future Work 

It has been presented a 0{n^) algorithm that allows the automatic extraction of 
3D Delaunay triangulations from free genus real objects. The only input informa- 
tion about the geometry of the objects is a set of 3D points irregularly distributed 
over the whole objects’ surfaces. We must remark that n is the number of tetra- 
hedra achieved by the spatial partition algorithm stated by Faugueras [12], and n 
depends not only on the number of considered points, but also on their positions. 
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The use of a topological criteria is not enough to obtain a mesh that cor- 
rectly fits the shape of the objects without a human interaction, and it has been 
necessary to include additional restrictions to the process in order to become 
determinist. In our case, the heuristic chosen has been the perimeter of the tri- 
angular facets, mainly, because of its simplicity and good results achieved over 
other measures. The algorithm has been tested successfully with numerous free 
shape real objects, as we have shown with some examples presented above. 

Thinking in future works, it should be desirable an optimization of the algo- 
rithm to reduce the response time by means of a parallel implementation, taking 
advantage of the local nature of the operations [7]. 
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Abstract. In many application areas of imaging sciences, object 
information captured in multi-dimensional images needs to be 
extracted, visualized, manipulated, and analyzed. These four groups of 
operations have been (and are being) intensively investigated, 
developed, and applied in a variety of applications. In this paper, after 
giving a brief overview of the four groups of operations, we put forth 
two main arguments: (1) Computers are digital, and most image 
acquisition and communication efforts at present are toward digital 
approaches. In the same vein, there are considerable advantages to 
taking an inherently digital approach to the above four groups of 
operations rather than using concepts based on continuous 
approximations. (2) Considering the fact that images are inherently 
fuzzy, to handle uncertainties and heterogeneity of object properties 
realistically, approaches based on fuzzy sets should be taken to the 
above four groups of operations. We give two examples in support of 
these arguments. 



1 Introduction 

In imaging sciences, particularly medical, there are many sources of multidimensional 
images [1]. 

2D: digital/digitized radiographic images, tomographic slices. 

3D: a time sequence of 2D images of a dynamic object, a stack of slice images of a 
static object. 

4D: a time sequence of 2D images of a dynamic object for a range of imaging 
parametric values (e.g., MR spectroscopic images of a heart), a time sequence 
of 3D images of a dynamic object. 

5D: a time sequence of 3D images of a dynamic object for a range of parametric 
values. 

Three- and higher-dimensional images may also be generated by a computational 
step. For example, a 5D binary image is produced if we apply a "lifting" operation [2] 
to a 4D gray-level image to represent the 4D image intensity distribution as a surface 
in a 5D space. Columns of volume elements in this higher dimensional space 
represent the height of the surface. 
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For ease of further reference, we will refer to a multidimensional image simply as a 
scene and represent it by a pair C = (C, j), where C, the scene domain, is a 
multidimensional rectangular array of spatial elements {spels for short), and / the 
scene intensity, is a function that assigns to every spel a vector whose component 
elements are from a set of integers. 

There is usually an object of study for which the scene is generated. This may be a 
physical object such as an organ/tissue component, a tumor/fracture/lesion/an 
abnormality, a prosthetic device, or a phantom. It may also be a conceptual object 
such as an isodose surface in a radiation treatment plan, an activity region highlighted 
by PET or functional MRI, or a mathematical phantom. The objects may be rigid, 
deformable, static, or dynamic. For example, the bones at a moving joint represent a 
dynamic, rigid object assembly. The heart muscle and the blood pool in the various 
chambers constitute a dynamic, deformable object assembly. We will refer to any 
operation, which, for a given set of multimodality scenes of an object of study, 
produces information about the object of study, as 3D imaging. The information may 
be qualitative - for example, a 3D rendition of a static object, an animation sequence 
of a dynamic object or an animation sequence of a static object corresponding to 
different viewpoints. It may be quantitative - for example, the volume of a static 
object, and the rate of change of volume of a dynamic, deformable object, the extent 
of motion of a rigid dynamic object. The purpose of 3D imaging is, therefore, given 
multiple scenes as input, to output qualitative/quantitative information about the 
object of study. 

In this exposition, we will give a quick overview of the various 3D imaging 
operations that are commonly used and identify the challenges currently faced. We 
argue with strong evidence that, since the scenes are inherently digital, we should take 
entirely digital approaches to realize all 3D imaging operations. We also suggest that, 
since object information in scenes is always fuzzy, we should take approaches that are 
based on fuzzy set concepts. The need to fulfill these strategies opens numerous 
problems that are inherently digital in both hard and fuzzy settings. The reader is 
referred to [2] for a detailed account of 3D imaging operations and their medical 
applications. 

Although the examples considered in this exposition are all medical, the principles 
and the arguments are also applicable to other application areas. 



2 An Overview of 3D Imaging Operations 

3D imaging operations may be classified into four groups: preprocessing, 
visualization, manipulation, and analysis. All preprocessing operations aim at 
improving or extracting object information from the given scenes. The purpose of 
visualization operations is to assist humans in perceiving and comprehending the 
object structure/characteristics/function in two, three and higher dimensions. The 
manipulation operations allow humans to interactively alter the object structures 
(mimic surgery, for example). The analysis operations enable us to quantify object 
structural/morphological/functional information. 
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In the rest of this seetion, we will eursorily examine these four elasses of 
operations. The referenees given here are only samples from a literature, whieh has 
over a thousand publieations exeluding pure applieation-direeted papers. 



2.1 Terminology 

Body region: Region of the spaee, for example, of the human body, that is imaged. 

Pixel: Image element in a 2D seene. 

Spel: Image element in a 2D, 3D, 4D, . . ., seene. 

Scalar scene: A seene wherein eaeh spel has one value assigned to it. 

Vector scene: A seene wherein eaeh spel has two or more values assigned to it. For 
example, T2 and proton density value in an MR seene. 

Scene intensity: The value(s) assigned to the spel. 

Scene domain: The set of all spels in a seene. 

Pixel size: The size of the spel within the natural sliee of the seene. 

Slice spacing: The distanee between the centers of two successive slices. In case of 
isotropic resolution, the spels have equal length in all dimensions. 

Scene coordinate system: A coordinate system affixed to the scene. 

Object: An object of study in the body region such as an organ or a pathology such as 
a tumor. 

Object system: A collection of objects. 

Structure: A computer representation of an object. It may be hard (crisp or binary) or 
fuzzy. In the hard case, a spel in the scene domain is considered to be either in 
the object or not in the object. In the fuzzy case, each spel has a value between 0 
and 100 that indicates the degree of objectness assigned to the spel. 

Structure system: A collection of structures. 

Structure coordinate system: A coordinate system affixed to the structure/structure 
system. 

Rendition: A 2D scene created by the computer that depicts some aspects of the 
object information contained in a scene. For example, the display of a slice of the 
scene, a shaded-surface display of a structure computed from the scene. 

Display coordinate system: A coordinate system affixed to the display device. 

Imaging device coordinate system: A coordinate system affixed to the imaging 
device. Information about the various coordinate systems is useful in registering 
scenes and structures. 



2.2 Preprocessing Operations 

The input to these operations is a set of scenes of a body region and the output is 
either a set of scenes or a structure system. Typically, the structure system contains 
only one structure corresponding to the object of interest. 

Volume of Interest (VOI) [3]: 

The input scene Q = (C/,yj) is converted to another smaller scene Q = {Co,fo) such 
that Co Cl Ci and fo is sl restriction of fi to C^. is specified, usually interactively, to 
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be the set of spels in a reetangular box, a sphere, or an ellipsoid (and their equivalents 
in higher dimensions). The purpose here is to make Co just eontain the objeets of 
interest and to reduee the amount of data to be handled in other expensive operations. 
In large, routine applieations, to automate this operation is a ehallenge. For this, 
elearly, some object knowledge is essential. 

Filtering: 

In these operations, the output scene Q is obtained from the input scene C, by 
modifying the intensity values in such a way as to either enhance object information 
[4] or to suppress unwanted non-object information such as noise [5] in C^. Usually 
for these operations = C/. Various types of filters are available under these two 
categories. Unfortunately, usually some non-object information is also enhanced by 
the former methods and object information is also suppressed by the latter methods. 
The challenge in filtering is how to incorporate object knowledge into the filter design 
so as to achieve optimum performance. 

Interpolation: 

The purpose of interpolation is to change the level and orientation of discretization 
of Ci to a desired entity. For these operations, usually ^ C^. These operations are 
needed for converting a non-isotropically sampled scene into a scene with isotropic 
resolution, or for obtaining scenes at desired level and orientation of discretization. 
Two classes of operations are available: scene-based and object-based. \n scene-based 
methods [C\,fo(c) is estimated from f(c) for spels c' lying in the close vicinity of c. In 
object-based methods [7], some object information is used in guiding interpolation. 
Object-based methods have been shown repeatedly in the literature to produce better 
results than scene-based methods. The challenge here is how to incorporate object 
knowledge specific to the application into the interpolation process. 

Registration: 

The purpose of registration is to represent structures obtained from multiple scenes 
in a common coordinate system. These are needed for combining object information 
obtained from multiple modalities (such as MRI, PET and CT) for the same body 
region, and for determining change, growth, motion, and displacement of objects over 
time. Again two classes of methods are available: scene -based and object-based. In 
scene-based methods [8], scene intensity patterns are matched, while in object-based 
methods [9] structures derived from scenes are matched. Both rigid and deformable 
methods of registration under both categories are available. As in other operations, the 
challenge here is to incorporate specific object knowledge into the registration process 
in both object-based and scene-based methods. 

Segmentation: 

This operation, the most crucial among all 3D imaging operations, outputs a 
structure system from a given set of scenes. It consists of two related tasks - 
recognition and delineation. Recognition is the process of determining roughly the 
whereabouts of the objects in the scene, and delineation is the process of determining 
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precisely the spatial extent and composition of the objects in the scene. 
Knowledgeable humans usually outperform computer algorithms in the high-level 
task of recognition, while computer algorithms can outperform humans in the precise, 
accurate, and efficient delineation of objects. 

There are two classes of approaches to recognition - automatic (knowledge- 
based/model-based) [10] and human-assisted. By far the latter is the most commonly 
used in practice. At the outset, two classes of approaches to delineation may be 
identified - boundary-based [11] and region-based [12]. In the former, the output is a 
set of boundaries of the object, and in the latter, it is a set of object regions. Each of 
these two strategies can be further divided into subgroups - hard [11] and fuzzy [12] - 
depending on whether the output is a hard or a fuzzy set. Combined with the two 
strategies of recognition, therefore, we may identify eight classes of approaches to 
segmentation. 

As seen from the outline of other preprocessing operations, object knowledge 
usually facilitates all 3D imaging operations, including segmentation. This implies 
that segmentation is helpful/needed for all 3D imaging operations, ironically also for 
segmentation itself To devise generic segmentation methods with high precision, 
accuracy, and efficiency, that can be quickly adapted to a given application, is indeed 
the greatest challenge in 3D imaging and in image analysis. 



2.3 Visualization 

The input to these operations is a set of scenes or a structure system and the output is 
a rendition. 

Two classes of methods are available: scene-based wherein renditions are created 
directly from given scenes, and object-based, wherein renditions are created from 
(hard or fiizzy) structures extracted from the scene. Scene-based methods may be 
further divided into two groups: slice mode and volume mode. In the former, 2D 
” slices" of different orientations - natural, orthogonal, oblique and curved - are 
extracted from the scene and displayed as a montage or in a roam through fashion 
using gray scale, color or overlay [2]. In the latter, surfaces, interfaces and intensity 
distributions are displayed with a variety of 3D cues using surface rendering and 
volume rendering techniques [13, 14]. In object-based methods, the hard or fuzzy 
structures extracted from the scene are displayed with 3D cues using surface 
rendering and volume rendering techniques [15, 16]. The challenges in visualization 
are the realistic display of objects including color, texture and surface properties, 
speeding up volume rendering, and objective evaluation of the large number of 
rendering schemes that are available. 



2.4 Manipulation 

These operations are needed for editing structures (for simulating surgery in medical 
applications) for unobscured visualization, and for developing aids for interventional 
procedures. They take as input a structure system and output another structure system 
by altering structures or their relationship. Two classes of operations are being 
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developed: rigid [15, 17] with operations to eut, separate, add, subtraet, move and 
mirror structures and their components, and deformable [18]. The main challenge in 
manipulation is to realize the manipulative operations realistically utilizing object 
material properties. 



2.5 Analysis 

The purpose of these operations is, given a set of scenes or a structure system, to 
generate a quantitative description of the morphology or function of the objects in the 
body region captured in the scene. 

Two classes of operations are available: scene-based, wherein quantities based on 
scene intensities such as intensity statistics within object regions, tissue density, 
activity, perfusion, flow are estimated; and object-based, wherein quantities measured 
from segmented structures and how they change with time are estimated including 
distance, length, width, curvature, area, volume, kinematics and mechanics. The main 
challenge here is the assessment of the accuracy of the estimated measures. 



3 Go Digital, Go Fuzzy 

Any scene of any body region exhibits the two following important characteristics of 
the objects contained in the body region. 

Graded Composition: The spels in the same object region exhibit heterogeneity of 
scene intensity due to the heterogeneity of the object material, and noise, blurring, and 
background variation introduced by the imaging device. Even if a perfectly 
homogeneous object were to be imaged, its scene will exhibit graded composition. 

Hanging-togetherness (Gestalt): In spite of the graded composition, knowledgeable 
humans do not have any difficulty in perceiving object regions as a whole (Gestalt). 
This is a fuzzy phenomenon and should be captured through a proper theoretical and 
computational framework. 

There are no binary objects or acquired binary scenes. Measured data always have 
uncertainties. Additionally, scenes are inherently digital. As seen from the previous 
section, no matter what 3D imaging operation we consider, we cannot ignore these 
two fundamental facts - fuzziness in data and their digital nature. 

We have to deal with essentially two types of data - scenes and structures. We 
argue that, instead of imposing some sort of a continuous model on the scene or the 
structure in a hard fashion, taking an inherently digital approach, preferably in a fuzzy 
setting, for all 3D imaging operations can lead to effective, efficient and practically 
viable methods. Taking such an approach would require, in almost all cases, the 
development of the necessary mathematical theories and algorithms from scratch. We 
need appropriate theories and algorithms for topology, geometry and morphology, all 
in a fuzzy, digital setting, for dealing with the concept of a "structure” in scenes. Note 
that almost all the challenges we raised in the previous section relate to some form of 
object definition in scenes. Therefore, they all need the above mathematical and 
algorithmic developments. Additionally, since we deal with deformable objects (all 
soft-tissue organs, and often even bone, as in distraction osteogenesis - the process of 
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enlarging or compressing bones through load applied over a long period of time), we 
also need fuzzy digital mechanics theories and algorithms to handle structure data 
realistically for the operations relating to registration, segmentation, manipulation and 
analysis. Because of the digital setting, almost all challenges raised previously lead to 
discrete problems. Since we are dealing with ^-dimensional scenes, the mathematics 
and algorithms need to be developed for hard and fuzzy sets of spels defined in n- 
dimensions. 

In the rest of this section, we shall give two examples of digital/fuzzy approaches 
that motivated us to the above argument. The first relates to modeling an object as a 
3D surface and visualizing it. The second relates to the fuzzy topological concept of 
connectedness and its use in segmentation. 



3.1 Polygonal versus Digital Surfaces 

In hard, boundary-based 3D scene segmentation (e.g., thresholding approaches), the 
output structure is often a 3D surface. The surface is represented usually either as a 
set of polygons (most commonly triangles [19]), or in a digital form [20] as a set of 
cubes or as a set of oriented faces of cubes. We shall describe in some detail how both 
the representation and rendering of such surfaces is vastly simpler and more efficient 
using digital approaches than using polygonal or other continuous approximations for 
them. 

Let us consider the representation issues first. We shall subsequently consider the 
rendering issues. It is very reasonable to expect these surfaces to satisfy the following 
three properties since surfaces of real objects possess these properties. 

(1) The surface is connected. 

(2) The surface is oriented. This means that it has a well defined inside and 
outside. 

(3) The surface is closed; that is, it constitutes a Jordan boundary. The latter 
implies that the surface partitions the 3D space into an “interior” set and an 
“exterior set such that any path leading from a point in the interior to a point 
in the exterior meets the surface. 

The definitions of connectedness, orientedness, and Jordan property are much 
simpler, more natural and elegant in the digital setting using faces of cubes (or cubes) 
than using any continuous approximations such as representations via triangles. These 
global concepts can be arrived at using simple local concepts for digital surfaces [21, 
22]. For example, orientedness can be defined by thinking of the faces to be oriented. 
That is, a face with a normal vector pointing from inside of the surface to its outside 
in the -x direction is distinguished from a face at the same location with a face normal 
in exactly the opposite direction. Usually the surface normals at various points p on 
these surfaces (in both the polygonal and the digital case) are estimated independent 
of the geometry of the surface elements, for example, by the gradient at p of the 
intensity function / of the original scene from which the surface was segmented [23]. 
The gradient may also be estimated from other derived scenes such as a Gaussian 
smoothed version of the segmented binary scene [24]. Since the surface normals 
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dictate the shading in the rendering proeess, the surfaee elements are used here only 
as a geometrie guide rather than as detailed shape deseriptors of the surfaee. 
Therefore the digital nature of the geometry introdueing “stairease” effeets in 
renditions ean be more or less completely eliminated. Sinee the surfaee elements in 
the digital ease are simple and all of identieal size and shape, they ean be stored using 
elever data struetures that are typieally an order of magnitude more compaet than their 
polygonal eounterparts [25]. Finally, there is a well-developed body of literature (e.g., 
[26], Chapters 6, 7) deseribing the theory of digital surfaees that naturally generalizes 
to f7-dimensions for any n. Sueh a generalization is very diffieult for polygonal 
representations. 

Let us now eome to the rendering issues. Rendering of digital surfaees is 
eonsiderably simpler and more effieient than that of polygonal surfaees, the main 
reason being the simplieity of the surface elements and of their spatial arrangement in 
the former case [15, 27]. There are mainly two eomputational steps in any rendering 
algorithm: hidden surface removal and shading. Both these steps can be considerably 
simplified exploiting the speeial geometry of the surface elements, reducing most 
expansive computations to table lookup operations. For example, the faces in a digital 
surface can be classified into six groups based on their face normals (eorresponding to 
the directions (+x, +y, +z, -x, -y, -z). For any viewpoint, all faces from at least three of 
these groups are not visible and hence ean be disearded without doing any 
computation per face [28]. There are other properties that allow the rapid projection 
of discrete surfaces in a baek-to-front or front-to-back order by simply aceessing the 
surface elements in some combination of column, row, and slice order from a 
rectangular array. Triangular and other continuous approximations to surfaces simply 
do not possess such computationally attractive properties. It was shown in [25] based 
on 10 objects of various sizes and shapes that, digital surface rendering entirely in 
software on a 300 MHz Pentium PC can be done about 4-17 times faster than the 
rendering of the same objects, whose surfaces are represented by triangles, on a 
Silicon Graphics Reality II hardware rendering engine for about the same quality of 
renditions. Note that the Reality II workstation is vastly more expensive than the 
Pentium PC. 

The design of rendering engines such as Reality II was motivated by the need to 
visualize, manipulate, and analyze structure systems representing human-made 
objeets such as automobiles and aircrafts. Early efforts in modeling in computer 
graphics took, for whatever reason, a continuous approach. Digital approaches to 
modeling have originated mostly in medical imaging [29] and have recently been 
applied also to the modeling of human-made objects [30] with equal effectiveness. 
Digital approaches are more appropriate for modeling natural objects (such as internal 
human organs) than continuous approaches. Natural objeets tend to be more eomplex 
in shape and morphology than human-made objects. The applications of digital 
approaches to human-made objects [30] have demonstrated that the latter are equally 
appropriate even for human-made objects. 
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3.2 Fuzzy Connected Object Definition 

Although much work has been done in digital geometry and topology based on hard 
sets, analogous work based on fuzzy sets is rare [12, 31]. As pointed out earlier, the 
uncertainties about objects inherent in scenes must be retained as realistieally as 
possible in all operations instead of making arbitrary hard deeisions. Although many 
fiizzy strategies have been proposed partieularly for image segmentation, none of 
them has eonsidered the spatio-topologieal relationship among spels in a fuzzy setting 
to formulate the eoneept of hanging-togetherness. (We note that the notion of hard 
eonneetedness has been used extensively in the literature [31], Chapter 1. But hard 
eonneetedness already assumes segmentation and removes the flexibility of utilizing 
the strength of eonneetedness in segmentation itself) This is a vital pieee of 
information that ean greatly improve the immunity of objeet definition (segmentation) 
methods to noise, blurring and background variation. 

Given the fuzzy nature of objeet information in scenes, the frameworks to handle 
fuzziness in seenes should address questions of the following form: How are objeets 
to be mathematically defined in a fuzzy, digital setting taking into account the graded 
composition and hanging-togetherness of spels? How are fuzzy boundaries to be 
defined satisfying a Jordan boundary property? What are the appropriate algorithms to 
extract these entities from scenes in such a way as to satisfy the definitions? These 
questions are largely open. We will give one example below of the type of approaches 
that can be taken. This relates to fuzzy eonnected objeet definition [12]. This 
framework and the algorithms have now been applied extensively on lOOO’s of scenes 
in several routine clinical applications [32-34] attesting to their strength, practical 
viability, and effectiveness. 

We define a fuzzy adjacency relation a on spels independent of the scene 
intensities. The strength of this relation is in [0, 1] and is greater when the spels are 
spatially closer. The purpose of oc is to capture the blurring property of the imaging 
device. 

We define another fuzzy relation k, called affinity, on spels. The strength of this 
relation between any two spels c and d lies in [0, 1] and depends on a as well as on 
how similar are the scene intensities and other properties derived from scene 
intensities in the vicinity of c and d. Affinity is a local fuzzy relation. If c and d are far 
apart, their affinity is 0. 

Fuzzy connectedness is yet another fuzzy relation on spels, defined as follows. For 
any two spels c and d in the scene, consider all possible connecting paths between 
them. (A path is simply a sequence of spels such that the successive spels in the 
sequence are ’’nearby”.) Every such path has a strength which is the smallest affinity 
of successive pairwise spels along the path. The strength of fiizzy connectedness 
between c and d is the largest of the strength of all paths between c and d. 

A fuzzy connected object of a certain strength 0 is a pool O of spels together with 
an objectness value assigned to each spel in O. O is such that for any spels c and d in 
O, their strength of connectedness is at least 0, and for any spels ^ in O and g not in 
O, their strength is less than 0. 

Although the computation of a fuzzy connected object in a given scene for a given 
K and 0 appears to be combinatorially explosive, the theory leads to solutions for this 
problem based on dynamic programming. In fact, fuzzy connected objects in 3D 
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scenes (256x256x60) can be extracted at interactive speeds (a few seeonds) on 
modern PCs such as a 400 MHz Pentium PC. 



4 Concluding Remarks 

In this article, we have first given an overview of the operations available for 3D 
imaging - a discipline wherein, given a set of multidimensional seenes, the aim is to 
extract, visualize, manipulate, and analyze object information captured in the seenes. 
We have also raised numerous challenges that are encountered in real applieations. 
Computers are digital. Current attempts in image aequisition, storage, and 
eommunication are completely digital or are proeeeding in that direction. We have 
presented an argument with evidenees that there are considerable advantages in taking 
an inherently digital approaeh to realizing all 3D imaging operations. We have also 
argued that, sinee object information in scenes is fuzzy, the digital approaches should 
be developed in a fuzzy framework to handle the uneertainties realistically. This calls 
for the development of topology, geometry, morphology and mechanies, all in a fuzzy 
and digital setting, all of which are likely to have a significant impact on imaging 
sciences such as medical imaging and their applications. 
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Abstract. A digital naive plane may be seen as a repetition of (n, m)- 
cubes, set composed of n x m adjacent voxels or more generally sets of p 
voxels. In a previous works [VG99a], we have shown how to link the pa- 
rameters of a naive plane to the different configurations of voxels sets by 
the construction of the associated Earey net. We propose an algorithm to 
recognize any set of coplanar voxels. This algorithm will be used for the 
polyhedrization of voxel objects. This is an original contribution offering 
a new method for digital plane recognition. 

Keywords: Digital naive plane - Polyhedrization - Recognition - Equiv- 
alence classes 



1 Introduction 

The characterization of digital naive planes is now solved. Effectively, naive 
planes have been studied through their configurations of tricubes [Sch97,VC97], 
of (n, m)-cubes [VC99b] and connected or not connected voxels set 
[VC99a,Ger99]. The link between the normal equation of a plane and config- 
uration of voxels set has been studied by the construction of the corresponding 
Earey net [VC99a]. 

We can find many references about the recognition of digital planes. Some al- 
gorithms were related to the construction of the convex hull of the studied voxels 
set [KS91,KR82]. Other approaches use linear programming [ST91], mean square 
approximation [BF94] or Fourier-Motzkin transform [FP99,FST96,Vee94]. 

The first algorithms entirely discrete were to recognize rectangular pieces of 
naive planes [Deb95,DRR94,VC99b]. In this paper, we propose an incremental 
algorithm to recognize any coplanar voxels set. 

2 Definitions 

Let a, 6, c, r be four integers such as a, 6, c are not null all together and verify 
gcd(a, 6, c) = 1. 
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The digital naive plane P(a, 6 , c, r), where (a, 6 , c) is its normal vector and r 
its translation parameter, is the set of points (x, y, z) in 7L^ verifying: 

{) < ax by cz r < max(|a|, | 6 |, |c|) 



We will limit our study to naive planes V{a, 6 , c, r) in the 48th part of space such 
as0<a<6<c and c 7 ^ 0. These planes are functional in {)xy. For each point 
(x, y) in 7L^ ^ we have only one point (x, y, z) in belonging to the naive plane. 
Let us notice /(a, 6 , c, r) the function from to 7L defined by: 



/(a, 6 , c, r)(u, v) 



au bv ^ r 
c 



where [w] denotes the integer part of the real number re, then z = f(a, 6 , c, r)(x, y). 

The points (x, y, z) of the naive plane P(a, 6 , c, r) which verify ax + 6 ^ + cz + 
r = 0 (resp. ax + 6 ^ + cz + r = c — 1 ) are the lower (resp. upper) leaning 
points of the naive plane. 



3 Equivalence Class of a Voxels Set 

Let n in IN* and V = {(n, ji), • • • , (in^jn)} a set of n points of 7L^ . 

The cluster of voxels 5(a, 6, c, r)(x, y) of the naive plane P(a, 6, c, r) indexed 
by R and with origin (x, is defined by: 

5(a,6,c,r)(x,^) 

n 

\^{{x + iq,y + jq,z + kq) \ {iq, jq) ^ V, kq = f {a,b, c,r){x + iq,y + jq) - z} 
9=1 

where z = /(a, 6, c, r)(x, ^). 

S{a,b,c,r){x,y) is the part of the naive plane going through the n voxels 

{x + iq,y+jq,z + kq), q = l,---,n. 

Let {xi^yi^zi) be a lower leaning point from V{a,b^c,r). It verifies axi + 
byi czi r = 0. The point (x, z) can be written as (x/ + a, + x, zi + w) 
with (u,v,w) in As it belongs to the naive plane V{a^b^c^r)^ it verifies 
0 < a(x/ + a) + b{yi + x) + c{zi + x;) + r < c. Let us notice r' equal to au-\-bv^cw. 
For <7 = 1, • • • , n, the point (x+i^, y+jq^ z-\-kq) verifies 0 < aiq-\-bjq-\-ckq-\-r' < c. 
Consequently, 5(a, 6, c, r)(x, y) can be written as: 

5(a, 6, c, r)(x, = {(x, y, z)} 0 5(a, 6, c, r')(0, 0) 

where A (B B is the Minkowski sum between the sets A and B. 

All real planes for which the discretization by the object boundary quantiza- 
tion method on the set {(x, ^)}©R is the set 5(a, 6, c, r)(x, y) have to go through 
the point (x/, yuzi). Moreover on the point (x, y) the discretization must be the 
voxel (x, y^z). A first equivalence class of 5(a, 6, c, r)(x, y) is the set of parame- 
ters (a, (3) with 0<o;</3<lof the real plane a{x — x/) + (3{y — yi) + z — Zi = 0 
verifying {) < au (3v w < 1. We notice this equivalence class by the set: 
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S{r){x,y) = Pi {(a,/?) | 0 < a < /3 < 1, 

<7=1 

{x-\-iq,y-\-jq,z-\-kq) G S {a, b, c, r){x , y) , ~1 < iqa^ jq(3 + kq < 1} 

For each integer point {i^j),we have /(a, 6, c, r)(x/+7, yi~\-j) = /(a, 6, c, r){xi^ yi) 
+/(a, 6, c, 0)(7, j). For q = 1, • • • , tt, the integer kq satisfies kq = /(a, 6, c, 0 )(tx + 
t; + jq) — f{a^ 6, c, 0 )(tx, t;). So 5(r)(x, is equal to 5(0 )(tx, t;). 

Let k) be the line in the parametric space W = {(cr,/3), ^ < ol < (3 < 

1} with equation ia jp k = 0. Let k) be the open-band of this space 

limited by the two parallel lines k 1) and k — 1 ). 

The equivalence class becomes: 

S{r){x,y)=Wf]l(\B{t„j,,k,) 

\q=l 

In a previous work [VC99a], we proved that the voxels set S centered on the 
lower leaning point (xi^yi^zi) and defined by: 

n 

£ = U S{a,b,c,r){xi - ig,yi - jg) 

9=1 

is a complete system. It is representative of the different configurations of voxels 
sets defined on V which generate the naive planes with normal (a^b^c). The 
equivalence class 8 of that set is the intersection of the open bands k) for 

(z, j) belonging toVoV (0 designs the Minkowski difference between two sets) 
and k verifying k = /(a, 6 , c, r)(x/ yi + j) — /(a, 6 , c, r)(xi^ yi). The equivalence 

classes of the different configurations appearing around leaning points split the 
space W in polygonal areas called Farey net associated to voxels sets defined 
on V. 

Example 1. Let A be the voxels set of a naive plane defined on V = {(0,0), 
( 1 ,- 1 ), (2,0), ( 2 , 1 )} and illustrated in figure 1 (a). The equivalence class A of 
that set is the intersection of the four bands 5(0, 0,0), 5(1, —1,0), 5(2,0, —1) 
and 5(2, 1, —2) (cf. Fig. 1(b)). Each rational point in that area corresponds to 
the parameters of a naive plane containing that configuration of voxels set. In 
figure 1(c), we have the Farey net associated to the voxels set defined on V. 

Now if we look for real planes ax -hj3y-l-z-hj = 0 with 0 < a < j3 < 1 
for which the discretization by the object boundary quantization method on 
the point (x, y) is the voxel (x, z) then the parameter 7 has to verify 7 = 
7 ' — {ax -\- (3y -\- z) with 0 < 7 ' < 1. Moreover if the dicretization on {(x, ^)} © E 
is the set 5(a, 6 , c, r)(x, ^), the parameters of that planes belong to the set: 

n 

S'{x, ^) = Pi {(<^, P, y — {ax j3y z)) \ 0 < < 1, 0 < 7 ' < 1, 

<7=1 

{x-\-iq,y + jq,z + kq) G S {a,b , c,r){x , y) , 0 < iqa jq(3 F kq < 1} 

This second equivalence class will be used in the recognition algorithm. 
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Fig. 1. (a) Set A = {(0, 0, 0), (1, — 1, 0), (2, 0, — 1), (2, 1, — 2)}; (b) Equiva- 
lence class of A; (c) Farey net associated to voxels set defined on V = 
{( 0 , 0 ), ( 1 ,- 1 ), ( 2 , 0 ), ( 2 , 1 )} 



4 Recognition Algorithm 



Let S = {(xg, Hq^ Zq)^ <7 = 1, • • • , n} be a set of n voxels. 

We are going to establish an incremental algorithm to identify the parameters 
of the naive planes going through the n points of S. 

The naive planes solutions are the planes V{a, h^c^r — {ax\ + hyi + czi)) for 

/ (2 1) T \ 

which the parameters ( — , - , - ) belong to the set: 



S = {(a,/3,7) G [0,1]^ X [0,1[| Vg G {1, ■•■,«} 0 < iqa + jqP + y + kg < l} 

where {iq^jq^ kq) is dehned as to be the integer points {xq — xi^yq — yi, Zq — zi) 
for q = 1, • • • , n. 

Let Bq in be the set of vectors (a, 6, c, r) such that the projection in 
the plane (c = 1) are the vertices of the convex huh of the space containing 
the parameters {a, / 3 , 7) of real planes for which the dicretization on the point 
{ip^jp) is the point {ip^jp^ kp) for p varying from 1 to q. 

We are going to construct the sets Bq for q = 1 , • • • , n. The following algo- 
rithm gives at step q the set Bq or the empty set if there is no solution. In the 
hrst case, the solution with the minimal periodicity can correspond to a vertex 
of the convex-hull, the median point between the projection of two vertices of 
Bq or the median point of the area limited by the projection of the vectors of Bq. 

As the discretization of ah real planes of the working space goes through the 
origin (here the origin is taken at point (xi, ^1, zi)), the algorithm starts with 



= {( 0 , 0 , 1 , 0 ), ( 0 , 1 , 1 , 0 ), ( 1 , 1 , 1 , 0 ), ( 0 , 0 , 1 , 1 ), ( 0 , 1 , 1 , 1 ), ( 1 , 1 , 1 , 1 )} 



composed by the six vectors (a, 6, c, r) such that the projection in the plane 
(c = 1) are the vertices of the convex- huh limiting the solution space of (a, /1, 7). 
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Algorithm at step q>2: 

We introduce the point {iq,jq, kq). 

Let Lq and the functions from IN^ to Z defined by: 

Lq{a, 6 , c, r) = aiq + hjq + ckq r 
L+ (a, 6 , c, r) = Lq{a, 6 , c^r) — c 

The naive plane of parameters (a^b^c^r) goes through the voxel (iq^jq^kq) if 
and only if 0 < Lq{a^b^r^c) < c. Consequently the vectors (a^b^c^r) of Bq 
verify Lp(a, 6, c, r) > 0 and L+ (a^b^c^r) < 0 for p = 1, • • • , 

Initialization: Bq = ^. 

For all the vectors Vi in Bq-i^ i = 1, • • • ,#{Bq-i) do: 

1. Process 

Step 1 : If Lq(Vi) > 0 and L'^{Vi) < 0 then the projection of Vi is still 
on the convex hull of the domain solution. We insert Vi in Bq. More partic- 
ularly, if Lq(Vi) = 0 (resp. L+(V^) = 0) we can say that the voxel {iq^jq^ kq) 
(resp. {iq,jq, kq — 1)) is a lower leaning point of the naive plane of parame- 
ters Vi. 



Step 2 : If Lq{Vi) < 0 (resp. L+(V^) > 0), we are going to search the point P 
such as Lq{P) = 0 (resp. T+(P) = 0). To do that, we use an algorithm based 
on the notion of median point [Farl6,Gra92]. 

For each vectors Vj , j>i, belonging to Bo-i and verifying 
Lq{Vj) > 0 (resp. Lj{Vj) < 0) do 
Pi = Vi and P2 = Vj 

While L,(Pi) + L,(P2)^0 (resp. L+ {P^) + L+ {P 2 ) ^ 0) do 

if i,(Pi) + i,(P 2 ) and L,(Vi) (resp. L+ (P^) + L+ {P 2 ) and 
L'^iy'i)) have the same sign then 
Pi = Pi P P 2 and P 2 Vj 

else 

Pi = Vi and P2 = Pi P P2 

End While 
End For 

The solution is given by P = Pi P P2. 

The point (iq^jq^kq) (resp. {iq^jq^kq — 1)) is a lower leaning point of the 
naive plane of parameters P. 

We insert the point P in Bq. 

2. Validation of Bq 

(a) For each vector V = (a, 6, c, r) in Bq we verify if the projection in the 
plane (c = 1) is a vertex of the convex hull. If V can be written as a 
combination of vectors of Bq then the projection of V is on the convex 
hull but it is not a vertex. So we suppress that point from the list. 
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(b) If #{Bq) < 2 , there is no solution and we suppress all the vertices from 
the list. 

(c) If = 3, we verify that the points (ajc^hlc) corresponding to the 

vectors (a, b, c, r) of Bq are not alined otherwise we suppress the vertices 
from the list. 

Example 2. We are going to illustrate this algorithm on an example. We want 
to know if the set of voxels in figure 2 belong to a naive plane of the studied 
48th part of space. We start with a first point defined as the origin of the voxels 




Fig. 2. Set of voxels to recognize 



set (cf. Fig 3(a)). The parameters set of naive planes including the origin are 
the rational points (a, /3, 7 ) contained in the domain limited by the projection 
(aj c^hj c^r ! c) of the vectors (a^b^c^r) of Bi. The set Bi is composed by the six 
vectors: 

= {( 0 , 0 , 1 , 0 ), ( 0 , 1 , 1 , 0 ), ( 1 , 1 , 1 , 0 ), ( 0 , 0 , 1 , 1 ), ( 0 , 1 , 1 , 1 ), ( 1 , 1 , 1 , 1 )} 

As it was previously mentioned, it is equivalent to say that parameters {a, (3) be- 
long to the area limited by the points (a/c, b/c) (cf. Fig 3(b)). We introduce the 
second point (1, —1, 0) (cf. Fig 4(a)). Let us compute the value L 2 {V) and LJ (V) 
on the different vectors V from Bi: 



{a,b,c,r) 


L 2 (a, b^c^r) = a — b r 


L 2 {a^b^c^r) = a — b-\-r — c 


( 0 , 0 , 1 , 0 ) 


0 


-1 


( 0 , 1 , 1 , 0 ) 


-1 


-2 


( 1 , 1 , 1 , 0 ) 


0 


-1 


( 0 , 0 , 1 , 1 ) 


1 


0 


( 0 , 1 , 1 , 1 ) 


0 


-1 


( 1 , 1 , 1 , 1 ) 


1 


0 



The vectors (0, 0, 1, 0), (1, 1, 1, 0), (0, 0, 1, 1), (0, 1, 1, 1) and (1, 1, 1, 1) verify the 
property indiced by step 1 of the algorithm. Ve insert these vectors in ^^ 2 - 
As 4 ^ 2 ( 0 , 1, 1,0) < 0, we apply the algorithm presented in step 2. We introduce 
the new vectors (0, 1, 2, 1) and (1, 2, 2, 1). But (0, 1, 2, 1) is the sum of vectors 
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( 0 , 1 , 1 ) 

( 0 / 1 , 1 /!)^^ 



(0/1,0/!)^^ 

( 0 , 1 , 1 ) 



( 1 , 1 , 1 ) 

(1/1,1/1) 



a 



(b) 



Fig. 3. (a) Set {(0,0,0)}; (b) Equivalence class 



(0, 0, 1, 0) and (0, 1, 1, 1). Similarly, the vector (1, 2, 2, 1) is the sum of vectors 
(0, 1, 1, 1) and (1, 1, 1, 0). Consequently, the vectors (0, 1, 2, 1) and (1, 2, 2, 1) are 
not present in IS 2 . 

Finally, we have: 

B 2 = {( 0 , 0 , 1 , 0 ), ( 1 , 1 , 1 , 0 ), ( 0 , 0 , 1 , 1 ), ( 0 , 1 , 1 , 1 ), ( 1 , 1 , 1 , 1 )} 

Every naive planes for which the projection (a/c, 6/c) of the normal (a,b,c) is 
contained in the area limited by the points (o! jd ^ h' jc') with (a', 6', c', r') belong- 
ing to 82 are solutions (cf. Fig 4(b)). We introduce the third point (2, 0, —1) (cf. 
Fig 5(a)). We compute the value Ls{V) and L^{V) on the different vectors V 
of 82 : 



{a,b,c,r) 


Ls(a, 6, c, r) = 2a — c + r 


(a, 6, c, r) = 2a — 2c + r 


(0,0, 1,0) 


-1 


-2 


(1,1, 1,0) 


1 


0 


(0,0, 1,1) 


0 


-1 


(0,1, 1,1) 


0 


-1 


(1,1, 1,1) 


2 


1 



The vectors (1, 1, 1, 0), (0, 0, 1, 1) and (0, 1, 1, 1) verify the property indiced by 
step 1 of the algorithm. We insert these vectors in 83 . As Ls(0, 0,1,0) < 0 
and (1,1,1,1)>0, we apply for these vectors the algorithm presented in step 
2. We make appear the new vectors (1, 1, 2, 0), (1, 1, 3, 1), (2, 2, 3, 2), (1, 1, 2, 2) 
and (1, 2, 2, 2). But we have: (1, 1, 3, 1) = (1, 1, 2, 2) (1, 1, 1, 0) and (2, 2, 3, 2) = 

(1,1, 2, 2) + (1,1, 1,0). These two vectors are not inserted. 

Finally, we have: 

^3 = {(1,1, 1,0), (0,0, 1,1), (0,1, 1,1), (1,1, 2,0), (1,1, 2, 2), (1,2, 2, 2)} 



Recognition of Digital Naive Planes and Polyhedrization 303 




Fig. 4. (a) Set {(0, 0, 0), (1, — 1, 0)}; (b) Equivalence class 



Every naive planes for which the projection (a/c, 6/c) of the normal (a, 6, c) is 
contained in the area limited by the points (a'/c', 6'/c') with (a', 6', c', r') belong- 
ing to are solutions (cf. Eig 5(b)). More particularly, the naive planes with 
normal (1,1,2) or (1, 2, 2) contains the initial configuration of 3 voxels. Einally, 




Fig. 5. (a) Set {(0, 0, 0), (1, —1, 0), (2, 0, —1)}; (b) Equivalence class 



we introduce the last point (2, 1, —2) (cf. Eig 6(a)). We compute the value L/^(y) 
and (E) on the different vectors V of 83: 
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(a, b, c, r) 


L 4 (a, b^c^r) = 2 a + 6 — 2 c + r 


LJ (a, b^c^r) = 2a + 6 — 3c + r 


( 1 , 1 , 1 , 0 ) 


1 


0 


( 0 , 0 , 1 , 1 ) 


-1 


-2 


( 0 , 1 , 1 , 1 ) 


0 


-1 


( 1 , 1 , 2 , 0 ) 


-1 


-3 


( 1 , 1 , 2 , 2 ) 


1 


-1 


( 1 , 2 , 2 , 2 ) 


2 


0 



The vectors (1, 1, 1, 0), (0, 1, 1, 1), (1, 1, 2, 2) and (1, 2, 2, 2) verify the prop- 
erty indiced by step 1 of the algorithm. We insert these vectors in As 
the value L 4 is negative for the vectors ( 0 , 0 , 1 , 1 ) and ( 1 , 1 , 2 , 0 ), we applied 
for these vectors the algorithm presented in step 2 . We make appear the new 
vectors (1, 1, 2 , 1 ), ( 1 , 1 , 3, 3), ( 2 , 2 , 3, 0), (1, 2 , 4, 4) and (3, 4, 6 , 2 ). But we have: 
(1,2, 4, 4) = (1, 1,3,3) +(0,1, 1,1) and (3, 4, 6,2) = (1,1, 2,1) + (0, 1, 1, 1) + 
(2, 2, 3, 0). These two vectors are not inserted. 

Finally, we have: 

^4 = {(1, 1,1,0), (0,1, 1,1), (1,1, 2, 2), (1,2, 2, 2), (1,1, 2,1), (1,1, 3, 3), (2, 2,3,0)} 

Every naive planes for which the projection (a/c, 6 /c) of the normal (a, 5, c) 
is contained in the area limited by the points (a'/c', 6 '/c') with (a', 6 ', c', r') 
belonging to B^ are solutions (cf. Fig 6 (b)). More particularly, the naive planes 
with normal (1, 1 , 2 ) or ( 1 , 2 , 2 ) or (2,2,3) contains the set of 4 voxels. We can 
verify in figure 6 (c) that this configuration of voxels set is contained in the naive 
plane with normal ( 1 , 1 , 2 ). 




Fig. 6. (a) Set {( 0 , 0 , 0 ), ( 1 , — 1 , 0 ), ( 2 , 0 , — 1 ), ( 2 , 1 , — 2 )}; (b) Equivalence class; 
(c) Part of the naive plane with normal ( 1 , 1 , 2)1 
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5 Polyhedrization of a Voxels Object 

The polyhedrization of a voxelized object has been studied using mainly approx- 
imation approaches [BF94] . Here we propose an algorithm which fully works on 
the discrete representation of digital naive planes. The polyhedrization can be 
made in different ways in order to obtain straight or smooth angles between 
adjacent planes. 

The present algorithm is an iterative process based on the recognition algo- 
rithm. 

Let Vq be a voxel of the object such that his surfel with normal N belongs to 
the boundary of the object. We considere this voxel as the origin of the plane. 
By applying the algorithm of recognition, we verify that Vq is center of a tricube 
such as all the surfels of the voxels with normal N belong to the boundary of 
the object. If Vq is not center of a tricube we color its surfel in white. Otherwise, 
we verify that its neighbours are centers of tricubes of the same naive plane. If 
a neighbour belongs to that plane we mark the surfel as belonging to that plane 
and we can analyze in a same way its neighbours. Otherwise, this voxel will be 
treated as the origin of a new plane. 

We analyze in the same way all surfels of the object boundary until they are 
colored in white or marked. 

The obtained results are shown on a chanfrein cube and a cube cut by three 
naive planes (cf. Fig 7). 




(a) (b) 



Fig. 7. Chanfrein cube and poly tope: Voxels with the same gray level belong 
to a same naive plane; white voxels belong to the boundaries between adjacent 
planes 
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6 Conclusion 

A generic algorithm for coplanar voxels recognition has been presented. This 
algorithm analyses any configuration of voxels set either connected or not con- 
nected. It is fully discrete working in the dual space issued from Farey net repre- 
sentation of the normal equation of a digital naive plane. This algorithm has been 
used for polyhedrization of the boundary of voxelized objects. As perspective, a 
lot of work has to be achieved precisely in computation of the exact area evalua- 
tion of such a boundary using the Pick theorem well known in 2D [Sta86,CM91]. 
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Abstract. In this paper we define the 3d topological map and give an 
optimal algorithm which computes it from a segmented image. This data 
structure encodes totally all the information given by the segmentation. 
More, it allows to continue segmentation either algorithmically or inter- 
actively. We propose an original approach which uses several levels of 
maps. This allows us to propose a reasonable and implement able solu- 
tion where other approaches don’t allow suitable solutions. Moreover our 
solution has been implemented and the theoretical results translate very 
well in practical applications. 



1 Introduction 

An early stage of an image analysis process is to group related information in 
the image into regions: it is segmentation. This problem has been widely studied 
and efficent algorithms which perform segmentation are well known. A related 
problem concerns the data structure used to represent the segmentation. Indeed, 
after segmentation, more complex algorithms (such as pattern recognition) need 
to be run in order to give useful results. These algorithms need a structuration 
of the data which represents completely all the information given by the seg- 
mentation. Moreover this structuration need to be efficiently computable. For 
example we may have to deal with the topological characteristics of the volume 
found, as well as to work interactively on the objects modelled. 

Numerous data structures have been already proposed. For example, a simple 
one is to associate each pixel to the region it belongs to [15]. But this solution 
is not efficient if we have to cut or merge regions. There is also a solution using 
quadtrees or octrees [8,16] which are based on a recursive decomposition of the 
image. But this structure is difficult to modify and doesn’t allow to check easily 
adjacency relations between regions. Same drawbacks can be reproached to the 
pyramidal structures [2,12]. 

Other structures focus on information related to the regions, for example the 
region adjacency graph [14], which codes the adjacency relation between regions. 
But it is not topologically consistent and doesn’t relate the inclusion or multiple 
adjacency, so it doesn’t allow a precise analysis. 



G. Borgefors, I. Nystrom, and G. Sanniti di Baja (Eds.): DGGI 2000, LNGS 1953, pp. 311-324, 2000. 
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Some other solutions try to use several of the formers in order to gather the 
advantages offered by each of them. But this lead to complex solutions, often 
non efficiently computable. 

This is this lack of adequate solutions which has lead [10] to define the topo- 
logical graph of frontiers. The interest of such a structure is to be consistent 
with the topology of the objects it represents. Moreover, this structure can be 
efficiently computed thanks to a linear algorithm in the number of pixels which 
proceeds in one scan of the image [9] . One can remark that this solution is close 
to the one of [3,4,5] which defines the 2d topological map, the difference being 
in the extraction algorithm. 

But these solutions are only defined in dimension 2. The need to work in 
dimension 3 has lead [1] to propose the border map. This solution can easily be 
extended to dimension n. Moreover, they have proposed a very simple algorithm 
which extract this structure from a 3d segmented image. But the border map 
is not homogeneous. It mixes topological and geometrical information. Further- 
more, the optimal algorithm proposed is not practicable in dimension 3 since it 
requires the definition of more than 4000 differents cases, which is impossible to 
achieve. 

We then propose an evolution of the border map: the 3d topological map. 
This data structure is completely homogeneous and requires less memory than 
the border map. We also give an extraction algorithm which uses a limited 
number of cases and so allows an implementation. The topological and the geo- 
metrical information has been completely separated allowing to modify one type 
of characteristic on one object independently to the other. 

In order to present the topological map, we define several maps levels, the 
border map becoming one of this levels. We start Sect. 2 by a short introduction 
on combinatorial maps. Then Sect. 3 we come back on the basis of border map. 
Section 4 we show the notion of topological map and the principle of the ex- 
traction algorithm in dimension 2. Then Sect. 5 we extend the precedent results 
to the dimension 3. At last Sect. 6 we show and analyse some results and we 
conclude this paper Sect. 7. 

2 Combinatorial Maps 

Combinatorial maps are a mathematical model of representation of space sub- 
divisions in any dimension. They are consistent with the topology of the space 
as they also code the adjacency relation between all the subdivisions. For a 
complete definition see for example [6,7,11]. 

A combinatorial map of dimension n is called a n-map. The n-map allows to 
represent subdivisions of orientable n-manifolds. The notion of maps has been 
generalized in [13] for the representation of quasi-manifold cellular of dimen- 
sion n, orientable or not. 

In dimension n, a n-map is an (n + l)-tuple M = (T),/3i,/32, . . . ,/4^) such 
that D is the set of darts, j3i is a permutation on D and the other (3i are 
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involutions^. Each f3i links two cells of dimension i. The f3i are called sews, and 
a dart b is said i-sewn to a dart c if f3i{b) = c. For each dimension the sets of 
darts corresponding to the Tcells are a partition of the set of all darts. Maps 
code the topology of objects, but it is not sufficient in order to represent them 
completely. Information on how drawing them are necessary: it’s the notion of 
embedding. For a cells subdivision of space, this embedding associates to each i- 
cell a geometric object of dimension i. 

For example in dimension 3, the embedding of a surface consists in relating 
a point to each vertex, an open curve to each edge and an open surface to each 
face. An open face is associated to each cell in order to avoid duplication of 
information. Indeed, border of an Tcell is an (i — l)-cell to which an embedding 
is already associated. 

But according to our needs or constraints, we can choose not to embed each 
cell and compute the missing embedding. For example, in dimension 2, we can 
embed only the edge with closed curves. In this case, the embedding of the 
vertices can be retrieved by taken the extremities of one incident edge. It is 
then necessary to ensure the coherence between topological and geometrical 
information: if two edges are incident to the same vertex, there embeddings 
have to have the same extremity. 

3 Border Maps 

We give, here, a brief recall on the border map defined in [1]. Note that in this 
paper we use combinatorial map instead of generalized map, this, in order to 
save memory. 

Definition 1. The border map is a combinatorial map which codes the inter- 
pixel contours of a segmented image. Each edge of the map corresponds to a 
segment of the interpixel contours of the image. 

The border map codes the topology of the contours but also the geometry. 
So it contains all the information given by the segmentation about the objects 
found. 

A level 0 map is the complete map obtained by sewing ni x n 2 squares 
between them. Then level i map is the map obtained from i different sorts of 
merging. For dimension 2, we get the level 1 map by merging adjacent faces of 
the same region. Level 2 map is then obtained by merging adjacent edges on the 
same line incident to a degree 2 vertex^. This level 2 map is in fact the border 
map. 

We can see Fig. 1, the border map of an image and also the two intermediate 
levels of maps. We can see on this example, that after the merging of faces, the 
map can be disconnected. This problem is solved by adding an inclusion tree to 
the connected components of the map. This tree can easily be computed by a 
linear algorithm in the number of darts of the map [1]. 

^ An application / is an involution if and only if / o / is the identity application. 

^ A vertex is of degree 2 if and only if it’s incident to exactly two edges. 
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Fig. 1. The border map 
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For dimension 3, we proceed on a similar way to obtain the border map. But 
an additionnal level is necessary (see Algo. 1). 



Algorithm 1: The border map in dimension 3 

Input: An image I of ni x ri 2 x ns pixels 
Output: The border map of I. 

1 Build a 3-map C of m x ri 2 x ns cubes sewn in between them; 

2 Merge all couple of adjacent volumes having the same region; 

3 Merge all couple of coplanar faces incident to a degree 2 edge; 

4 Merge all couple of aligned edge incident to a degree 2 vertex; 



The extraction algorithm in dimension 3 is simply the extension of the naive 
extraction algorithm of the border map in dimension 2. We now have three sorts 
of merging: 

— Merging of adjacent volumes which consists in deleting the faces in between 
the volumes and to update the sews in order to have only one volume (see 
Fig. 2.b): we get the Level 1. 

— Merging couple of coplanar faces which is equivalent to two merging of faces 
in dimension 2 (see Fig. 2. a): we get the Level 2. 

— At last, the merging of edges on the same line which is equivalent to do, one 
by one, several merging of edges in dimension 2: we then get the Level 3 
which is also the border map. 




a. Face merging b. Volume merging 

Fig. 2. Face and volume merging in dimension 3 
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As for dimension 2, an inclusion tree of the regions is needed. But it is not 
sufficient since we can also have non-connected faces. Indeed when the merging 
of coplanar faces is done, the border of faces can be disconnected and so the 
map. In order to solve this new problem we can add an inclusion tree of faces 
which, for each face, gives the list of faces included. 

In order to embed a border map in dimension 2 or 3, it is sufficient to give the 
coordinates of each vertex of the map. Indeed only elements on the same plane 
or line are merged, so edges are only straight segments and faces are coplanar to 
these edges. So the embedding of the vertices allows to compute the embedding 
of an edge from the embeddings of the two incident vertices, and the embedding 
of a face from the embedding of all incident edges. This choice of embedding 
facilitates process and extraction of the border map. 

But this leads to an important problem: the same topological information can 
be coded differently according to the geometry of the image : two topologically 
equivalent objects could have two different border maps. 

The border map mixes topological and geometrical information. The notion 
of topological map has been introduced to solve this problem. We first present 
it in dimension 2 in order to help us to understand the idea of the algorithms 
and the relation in the differents precodes of our different levels. But the goal 
remains the definition of a 3d topological map and this is presented in Sect. 5. 

4 2d Topological Map: Last Level of an Hierarchy of 
Maps 

In the border map, some merging are done accordingly to the embedding of 
the associated cells. So, two topologically equivalent images, but with different 
embeddings will have two different border maps. 

So we have to do merging only on topological criteria. Practically for dimen- 
sion 2, in addition to the merging of edges on the same line and incident to 
degree 2 vertices, we also have to merge edges incident to degree 2 vertices but 
not on the same line. Edges embeddings are no longer line segment and embed 
only vertices as for the border map is no longer sufficient. 

Definition 2. The topological map is a combinatorial map which codes the 
intcrpixcl contours of a segmented image. Each edge of the map corresponds to 
an entire interpixel frontier of the image. 

If we forget for a moment the problem of embedding, computing the topo- 
logical map is not really a problem, there is just another sort of merging to add 
after the computation of the border map: the merging of each couple of edges 
incidents to a degree 2 vertex. This Level 3 map is the topological map. If we 
take the example of Fig. 1 showing maps of Level 0 to 2, the topological map is 
then the one of Fig. 3. On this figure, the darts are drawn without taking into 
account their embedding. In fact the embedding of each edge should be coded by 
an open curve. An interesting way of doing this is to use a 1-map representing the 
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Fig. 3. Topological map without embedding 



curve corresponding to the embedding of the edge. We then get an hierarchical 
structure where z-maps are used to code the embedding of z-cells. 

We can see on Fig. 3 that there is exactly one edge for one frontier between 
two regions, so a frontier is always coded by exactly two darts. This imply that 
the topological map requires less memory than the border map. This can be 
very important to deal with 3d images. In the topological map only topological 
information darts have been kept. So two topologically equivalent images will 
give the same topological map, except for the embedding. 

We can easily deduce from the definition of the topological map a first ex- 
traction algorithm: building the different levels, starting from level 0 and then 
each level is built from the former by doing the supplementary mergings. In- 
deed deducing one level from the former level is very simple. Moreover, the same 
method can easily be applied for the definition of the topological map and its 
associated extraction algorithm for any dimension. 

But this algotihm is not optimal since it needs several image scans and it 
creates many darts which are in the following deleted. In order to improve this 
algorithm, [1] gives an algorithm based on precodes which computes directly the 
border map. We have extended this algorithm in order to compute the map of 
any level directly and in one scan. 



4.1 Extraction of the Level 1 Map in One Scan 

This algorithm scans the image with a 2 x 2 window and does the operations 
given by the configuration of the frontiers (called a precode) in the 2x2 window. 
So the map is built in one scan and no more than the exact number of darts are 
created. 

The idea of the algorithm is that during a scan from top to bottom and 
from the left to the right, the face corresponding to the current pixel can only 
be merged with the one above or the one on the left. Then, We only have four 
possible cases which corresponds to all the combinations of merging this face 
with the two others. Figure 4 shows this four face merging precodes. 

The algorithm uses two invariants which say that before processing one pixel: 

— The map corresponding to the image already scanned is built. 

— The dart corresponding to the right border of the last processed pixel is 
known. 
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Fig. 4. The four precodes of face fusion 



These two invariants insure us that we can proceed by modifying the map. 
Indeed we have to know how to link the map associated to the current pixel 
to the global map. So before scanning the image we have to build a map done 
with ni horizontal darts, corresponding to the upper border of the image, and 1 
vertical dart to the left, all these darts being sewn by f3\. So the invariant is also 
verified for the first pixel and for the first line of the image. 

During the scanning, we consider that the image is on a cylinder. Thus, 
when the algorithm treats the last pixel of a line, it can create the dart for the 
left border of the next line. Special treatments for borders of the image are so 
avoided. We only have to define the treatment associated for each of the four 
precode and the Level 1 map extraction algorithm will be defined. In fact this is 
easily done as we can see on the example of Fig. 5 and on Algo. 2 associated. 




Fig. 5. Precode / 2 : map before processing this precode and map to be obtained 



Algorithm 2: Processing of precode /2 

^ a new dart; t 2 ^ a new dart; 

(3i - sew{t2, (3i - sew{ti,t2)] 

ti); P2 — sew{l,u); 

(x,y) ^ the embedding of vertex incident to u; 
Embedding of vertex incident to t 2 ^ (x + 1, y + 1); 

return tl 



We can see on this example that not all the local configuration is known 
(the dart sewn to the dart u is not known), but it is not a problem since the 
operations to be done are independent of the fourth pixel. This precode returns 
the dart which will be the dart to the left for the next precode. At last we can 
see that the treatment is very simple, this is the same thing for the other three 
precodes. 

Extraction algorithm of Level 1 map in one scan is very simple and is de- 
scribed in Algo. 3. This algorithm is in fact similar for all maps level, it is just 
necessary to define the precodes corresponding to the level concerned. So we will 
now focus on the precodes for the other levels of the map. 
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Algorithm 3: Extraction algorithm in one scan 

Entree: An image I of ni x ri 2 pixels 
Sortie: Level i map of I. 

1 Build the upper border of the map; 

2 for j = 1 ri 2 + 1 do 

for i — 1 to ni -\-l do 
|_ Process the precode (i, j); 



4.2 Extraction of Level 2 Map in One Scan 

For level 2 map, we have to add the merging of edges incident to a degree 2 
vertex and on the same line. We can easily check that only two precodes are 
concerned by this merging: these are the precodes of Fig. 6. a. 



bi 






a. The two precodes for level 2 map. 





b. The four precodes for topological map. 



Fig. 6. Precodes for level 2 and 3 maps 



Finally we have six precodes for the Level 2 map: the four of the level 1 and 
these two of Fig. 6. a. The extraction algorithm is similar, we just have to look 
at the six precodes. 



4.3 Extraction of the Topological Map in One Scan 

For the topological map, in addition to the precedent merging, we have to merge 
the edges incident to a degree 2 vertex. These edges are necessary not on the 
same line, else they have been processed by one of the two precedent precodes 
for level 2 map. It is easy to see that there are only four precodes concerned by 
this sort of merging. These precodes are shown on Fig. 6.6. 

In addition, this level requires an embedding for the edges, which can be 
represented by a 1-map. The general algorithm is also similar to Algo. 3 but ten 
precodes have to be checked. Nevertheless, this algorithm is still simple and can 
easily be implemented. It is also optimal and built directly the topological map. 

Here we see the advantage of having defined several map levels: we have a 
relation between the number of precodes and the level of the map we want to 
extract in one scan. It allows to choose to extract directly Level i map accordingly 
to the need or development constraints, without paying more in processing time. 
This approach is particularly interesting in the 3d case where the number of 
precodes is huge. 
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5 3d Topological Map 

As for dimension 2, we have to do merging according to topological criteria and 
without taking into account the embedding associated to the cells. We have seen, 
Sect. 3 that the border map of dimension 3 is the Level 3 map where adjacent 
volumes of the same region, coplanar faces incident to degree 2 edge and same 
line edges incident to degree 2 vertex have been merged. The level 4 map is then 
the map obtained from the Level 3 map where incident to degree 2 edge faces 
are merged. 

As in dimension 2 where an edge embedding was added, this merging of non 
coplanar faces will require to add a face embedding. We can also use here a map 
for coding the embedding. Of course this map will be a 2d topological map where 
their edge embedding will be Id map. 

The topological map of dimension 3 (Level 5 map) is defined algorithmitically 
by merging in the Level 4 map each couple of edges incident to a degree 2 vertex: 
all the redundant topological information has been merged. 

The simple algorithm which builds the map of Level i consists now to build 
a complete map (Level 0 map) of ni x U 2 x ns voxels sewn, then to build level 
after level the different maps. The Level i -\-l map is obtained from the Level i 
map by doing the adequate additional merging. 

[1] have tried to apply the same method as for dimension 2 in order to 
bring a more efficient algorithm. But they have calculated, thanks to the Stirling 
numbers, that it could be necessary to define 4140 different precodes in order 
to be able to compute optimally and in one scan the border map! In fact this 
number denotes the total number of possible precodes in dimension 3. 

But we are interested only in manifolds, so in a first time, we have tried to 
calculate the number of manifold precodes. But we didn’t manage to achieve a 
formula giving us this result. We finally computed it by the help of a program 
and found 958 manifold precodes. This number is more encouraging but the 
implementation of such a number of precodes is not yet easy. In fact the de- 
composition of the problem in different levels of map will bring us a reasonable 
solution. 

5.1 8 Precodes for Level 1 Map 

The scanning of the image starts from up to down, from behind to front of and 
from left to right. So a voxel can only be merged with the upper voxel, the 
voxel behind and the voxel to the left. We have then 8 possibility of volume 
merging and we get the 8 precodes of Fig. 7. More Generally, in dimension n, we 
get 2'^ precodes for the Level 1 map. Indeed, the current element has exactly n 
neighbours already scan. 

The extraction algorithm requires a similar invariant as for the dimension 2 
version which ensure us that there exists in the map a face to the left, a face be- 
hind and a face with the top of the current voxel. The definition of the operations 
for each precode is not a lot more difficult that in dimension 2. The difficulties 
provide from the number of darts in one precodes, and from the fact that this 
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Fig. 7. 8 Precodes of Level 1 Map 



is not easy to represent and visualize the object in dimension 3. The general 
algorithm is similar as the one of dimension 2, it requires only an additional 
loop for the additional dimension. 

5.2 18 Precodes for Level 2 Map 

At this level we have to do the merging of coplanar faces incident to degree 2 
edge. Intuitively, in order to merge coplanar faces, the two voxels “containing” 
this faces are to be of the same region, and that the two front of voxels are of 
another region but the same for the two. We can see Fig. S.a some examples of 
such precodes. 




a. Two precodes for level 2 map. 




Two precodes for level 3 map. 



Fig. 8. Examples of precodes for Level 2 and 3 Map 



The low number of cases allows us to find exhaustively the 18 precodes for 
this sort of merging. We, then, immediately get the optimal extraction algorithm 
in one scan of Level 2 map: it’s enough to check for each voxel in which of the 26 
(the 18 for the coplanar face merging plus the 8 volume merging precodes) cases 
(precodes) it is and then to apply to corresponding operations on the map. 



5.3 27 Precodes for the Border Map 

For this level, we have to look at the merging of edges incident to a degree 2 
vertex and on the same line. Intuitively, this merging can only be done if the 
vertex in the center of the precode is of degree 2. It means that all the voxels are 
two by two of the same region. Fig. S.b shows some examples of such precodes. 
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In this case also, the low combinatorial allows us to find easily all the 27 
precodes involved. So the total number of precodes to extract directly and in 
one scan the border map is of 53. We obtain a number which is far away from the 
4140 first claimed in [1], and far away from the 958 cases of manifold. This last 
result is obtained thanks to our new approach which considers different levels of 
map. 

But the topological map, which is our final objective, requires the implemen- 
tation of Level 4 and 5. 

5.4 Level 4 Map and Topological Map 

Until the Level 3, only on the same line or on the same plane elements are 
merged. So the number of precodes remains limited and we have been able to 
determine all the precodes. But from level 4, things become more complicated 
and Ending all the precodes is a lot more difficult and a long work. So we have 
just tried to determine the number of precodes for each level. This was done 
by the help of a program which, for each precode among the 958, builds the 
corresponding map, then does the merging required by the Level i considered. 
It is then sufficient to group the precodes with the same merging and count the 
number of equivalence classes. 

The numbers of 8, 18 and 27 precodes for the first three levels of map have 
been confirmed. Then we get 98 precodes for the Level 4 map and 216 precodes 
more for the topological map. We recall that these numbers are the numbers 
of precodes to add to the precedent level. So the total number for an algorithm 
which would compute the topological map directly and in one scan would require 
the definition of 367 precodes! Our approach allows us to divide by half the total 
number of precodes required. But even if we are still far away from the 958 or 
better from the 4140 initially claimed, there are still too many precodes for a 
human. 

But an intermediate solution exists. It consists in computing directly the 
border map by the way of the 53 precodes, and then to proceed as for the 
general algorithm and compute Level 4 and Level 5 (the topological map) by 
doing the additional merging on the border map. This solution is the one we 
have implemented and is an interesting compromise between the running time, 
the development time and the memory space required. 

6 Experiments and Analysis 

In dimension 2, we have implemented the extraction of maps for each level 
with the precode based algorithm. Results about memory needed can be seen 
on Table 1. The running time doesn’t change a lot for the different levels and is 
about 0.20 seconds for a 512 x 512 on a 600 MHz PHI with 256 Mb of memory, 
included the computation of the inclusion tree. 

In dimension 3, we can see on Table 2, for each level, the differences between 
the number of darts and the memory space. The method implemented is the 
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Table 1. Comparison of the different levels in dimension 2 





Level 1 


Level 2 


Level 3 


Lena, 
256 region 


58 724 2d darts 
1 650 004 Bytes 


31 648 2d darts 
891 876 Bytes 


1 160 2d darts 
15 244 Id darts 
350 948 Bytes 


Lena, 

14218 regions 


220 900 darts 2d 
6 478 564 Bytes 


139 848 darts 2d 
4 209 108 Bytes 


67112 darts 2d 
36 368 darts Id 
3 095 524Bytes 



Table 2. Comparison of the different levels in dimension 3 





Level 1 


Level 2 


Level 3 


Level 5 


Unit 


Heart, 


245 296 


33 936 


23 900 


1408 


3d darts 


64 X 64 X 64 


- 


- 


- 


16 435 


2d darts 


12 regions 


7998 920 


1235 400 


854 032 


225 

598 068 


Id darts 
Bytes 


Skeleton, 


10 109 680 


1354 412 


922 072 


44 300 


3d darts 


512 X 512 X 177 


- 


- 


- 


632 396 


2d darts 


4 regions 


329 647 688 


49 479112 


33 050 192 


14 828 
22 905 824 


Id darts 
Bytes 


Legs, 


17869 952 


5152 672 


3 282 702 


710 518 


3d darts 


512 X 512 X 55 


- 


- 


- 


1667613 


2d darts 


13926 regions 


584 564 284 


177 611324 


117 772 284 


184 554 
82 155 560 


Id darts 
Bytes 



one described at the end of Sect. 5. The computation of the Level 5 map takes 
about 3 seconds for the heart (Fig. 9. a), 470 seconds for the skeleton (Fig. 9.b) 
and about 200 seconds for the legs. The running time of the other levels is a little 
more important, due to much more dart creations and so much more memory 
allocation calls. 

We can see on Table 2 that the gain in memory is very significant. These 
first results confirm that the 3d topological map is well suited for applications 
on 3d images. Indeed in dimension 3 the big amount of memory required could 
limit the size of the images processed. For example, for the legs where all regions 
are considered, only 55 slices have been processed and the level 1 map requires 
too much memory space to be used in a basic computer. In the contrary the 
topological map only needs about SOMb which is reasonable and can be used on 
a classical computer. 

7 Conclusion 

In this paper we have presented a data structure and an algorithm which recon- 
struct a 3d segmented image. This structure, the 3d topological map represents 
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a. Heart b. Part of a skeleton 

Fig. 9. Topological maps extracted from a segmented image 



totally the objects found by the segmentation. It allows to continue the segmen- 
tation either algorithmically or interactively, since based on combinatorial maps 
used in geometric modeling. This structure can be used to compute geometric 
properties as well as topological properties of the objects represented. 

The approach proposed which uses different levels of simplihcation has al- 
lowed to define on a simple manner the structure and to propose a simple, linear 
(but not optimal) extraction algorithm. In order to achieve an optimal solution, 
we have introduced a new algorithm based on precodes. This algorithm builds 
the different levels of map in one scan of the image and is then very efficient. Nev- 
ertheless, it remains very simple since it proceeds by checking a configuration, 
called a precode, and realizes the operations associated to the met configuration. 

In order to implement this algorithm, it is then sufficient to dehne these 
operations. A first study gave 4140 possibles cases, but we have proven here 
that there are only 958 precodes suitable for the non-manifold case which we 
are interested in, and moreover that our approach gives the possibility to study 
only one half, i.e. 367 precodes, for the last level of map. But the intermediate 
levels require only 8, 18, 27 and 53 precodes and then can easy be implemented. 

This level approach allows to choose an hybrid solution in order to avoid the 
367 precodes: compute directly an intermediate level of map and then build the 
last levels from this one. This is the solution we have implemented and tested. 
The border map have been computed directly with the 53 precodes and then, 
level 4 and the topological map has been built from the border map by doing 
appropriate merging of elements of the map. 

At last our experiments prove that the topological map requires less memory 
than the other map levels, which is an important point for 3d images. More- 
over in order to apply segmentation algorithm on a structure it is important 
that it doesn’t code redundant information. In fact, adjacency relation, that is 
topological information, is the key point of such algorithms. Our structure is 
then well-adapted, not only to reconstruction and modeling, but also to image 
analysis . 




324 



Yves Bertrand et al. 



Acknowledgements 

We wish to thank Lionel Gral for useful comments and careful reading of this 

paper. 

References 

1. Y. Bertrand, C. Fiorio, and Y. Pennaneach. Border map : a topological repre- 

sentation for nd image analysis. In G. Bertrand, M. Crouprie, and L. Perroton, 
editors. Discrete Geometry for Computer Imagery^ number 1568 in Lecture Notes 
in Computer Science, pages 242-257, Marne-la-vallee, France, March 1999. 312, 

313, 316, 319, 321 

2. M. Bister, J. Cornelius, and A. Rosenfeld. A critical view of pyramid segmentation 
algorithms. Pattern recognition Letters^ 9(11):605-617, 1990. 311 

3. J. P. Braquelaire and L. Brun. Image segmentation with topological maps and 
inter-pixel representation. Journal of Visual Communication and Image Represen- 
tation^ l(9):62-79, 1998. 312 

4. J. P. Braquelaire and J. P. Domenger. Representation of segmented images with 
discrete geometric maps. Image and Vision Computing^ pages 715-735, 1999. 312 

5. L. Brun. Segmentation damages couleur a base Topologique. These de doctoral, 
Universite Bordeaux I, decembre 1996. 312 

6. R. Cori. Un code pourles graphes planaires et ses applications. PhD thesis, Uni- 
versite Paris VII, 1973. 312 

7. R. Cori. Un code pour les graphes planaires et ses applications. In Asterisque^ 
volume 27. Soc. Math, de France, Paris, France, 1975. 312 

8. R. C. Dyer, A. Rosenfeld, and H. Samet. Region representation : boundary codes 
for quadtrees. In ACM 23, pages 171-179, 1980. 311 

9. C. Fiorio. Approche interpixel en analyse d’images : une topologie et des algo- 
rithmes de segmentation. PhD thesis, Universite Montpellier II, 1995. 312 

10. C. Fiorio. A topologically consistent representation for image analysis: the fron- 
tiers topological graph. In S. Miguet, A. Montanvert, and S. Ubeda, editors, 6th 
International Workshop, DGCI’96, number 1176 in Lecture Notes in Computer 
Sciences, pages 151-162, Lyon, France, November 1996. 312 

11. A. Jacques. Constellations et graphes topologiques. In Combinatorial Theory and 
Applications, pages 657-673, Budapest, 1970. 312 

12. W. G. Kropatsch. Building irregular pyramids by dual graph contraction. In lEE 
Proceedings : Vision, Image and Signal Processing, volume 142(6), pages 366-374, 
1995. 311 

13. P. Lienhardt. Subdivision of n-dimensional spaces and n-dimensional generalized 
maps. In 5^^ Annual ACM Symposium on Computational Geometry, pages 228- 
236, Saarbriicken, Germany, 1989. 312 

14. A. Rosenfeld. Adjacency in digital pictures. Inform, and Control, 26:24-33, 1974. 
311 

15. A. Rosenfeld and A. C. Kak. Digital Picture Processing, volume 2. Academic 
Press, New York, 1982. 311 

16. H. Samet. Region representation : quadtrees from boundary codes. In ACM 23, 
pages 163-170, 1980. 311 



Some Weighted Distance Transforms 
in Four Dimensions 



Gunilla Borgefors 



Centre for Image Analysis, Swedish University of Agricultural Sciences 
Lagerhyddvagen 17, SE-752 37 Uppsala, Sweden 
gunillaQcb .uu.se 



Abstract. In a digital distance transform, each picture element in the 
shape (background) has a value measuring the distance to the back- 
ground (shape). In a weighted distance transform, the distance between 
two points is defined by path consisting of a number of steps between 
neighbouring picture elements, where each type of possible step is given a 
length- value, or a weight. In 4D, using 3 x 3 x 3 x 3 neighbourhoods, there 
are four different weights. In this paper, optimal real and integer weights 
are computed for one type of 4D weighted distance transforms. The most 
useful integer transform is probably (3,4, 5,6), but there are a number 
of other ones listed. Two integer distance transform are illustrated by 
their associated balls. 



1 Introduction 

Results regarding the 4D digital space, Z^, are being found more and more in 
literature, both regarding theory and emerging applications. Examples where 
4D is used are: when processing 3D grey-level images, just as some 2D problems 
are solved using temporary 3D images; for volume data sequences, as ultrasound 
volume images of a beating heart; or for the discretisation of the parameter space 
of a robot or robot arm. Some examples are [6, 7, 8, 9]. 

In a Distance Transform (denoted DT), each element in the shape (back- 
ground) has a value measuring the distance to the background (shape). DTs have 
proven to be an excellent tool for many different image operations. Therefore, 
distance transforms (DT) in 4D are moving from being a theoretical curiosity, [1], 
to becoming a useful tool. 

The basic idea, utilised for most DTs, is to approximate the global Euclidean 
distance by propagation of local distances, i.e., distances between neighbour- 
ing pixels. This idea was probably first presented by Rosenfeld and Pfaltz in 
1966, [10]. This approach is motivated by ease of computation. In sequential 
computation only a small area of the image is available at the same time. In 
massively parallel computation (if such an approach still exists) each pixel has 
access only to its immediate neighbours. 

Weighted or chamfer distance transforms, denoted WDT. The local steps 
between neighbouring pixels are given different weights. In 2D, the most common 
WDTs are (2,3) and (3,4), were the first number is the local distance between 
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edge-neighbours and the second number is the local distance between point- 
neighbours, [2]. Weighted DTs can be computed in arbitrary dimensions by two 
raster scans through the image, where, at each point, the image values in a small 
neighbourhood of the point are used to compute the new point value [1,2]. A 
hrst, not very good, effort of discovering WDTs in higher dimensions is found 
in [1], 

Important theoretical results on general DTs in higher dimensions have been 
published [8] a few years ago. In this paper, necessary conditions for an nD DT 
to be a metric are presented. In [3], WDTs in 3D, fulfilling these criteria were 
exhaustively investigated for 3 x 3 x 3 neighbourhoods. There proved to be two 
types of such DTs, the “obvious” one and one less intuitive. In 4D, the situation 
is even more complex, and there are at least eight different cases of WDTs. In 
this paper, the most “natural” case will be investigated, and optimal real and 
integer weights for this case will be presented. 

In Section 2, the geometry and general equations are developed. In Section 3, 
optimal weights are computed, where optimality is defined as minimising the 
maximum difference from the Euclidean distance inanMxMxMxM image. 
In Section 4, the optimal real and integer WDTs are listed and two integer DTs 
are illustrated by their associated balls. 

2 Geometry and Equations 

Denote a digital shape on a hyper cubic grid T, and the complement of the shape 
T, where the sets F and F are not necessarily connected. A distance transforma- 
tion converts the binary image to a distance image, or Distance Transform (DT). 
In the DT each hyxel (hypervolume picture element) has a value measuring the 
distance to the closest hyxel in F. 

A good underlying concept for all digital distances is the one proposed by 
Yamashita and Ibaraki, [11]: 

Definition 1 The distance between two points x and y is the length of the 
shortest path connecting x to y in an appropriate graph. 

They proved that any distance is definable in the above manner, by choosing 
an appropriate neighbourhood relation and an appropriate definition of path 
length. 

In 4D hyper cubic space, each hyxel has four types of neighbours: 8 volume 
neighbours, 24 face neighbours, 32 edge neighbours, and 16 point neighbours. A 
path between two hyxels in the 4D image can thus include steps in 80 directions, 
if only steps between immediate neighbours are allowed. 

The DT(z, j, /c, /) of a hyxel in F is the minimum length of a path connect- 
ing (i,j,k^l) to any hyxel in T, where steps between volume neighbours have 
length a, steps between face neighbours have length 6, steps between edge neigh- 
bours have length c, and steps between point-neighbours have length d, and no 
other steps are allowed. Due to symmetry, it is enough to consider distances 
from the origin to a hyxel (x, z, ic), where 0<w<z<y<x<M and M 
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is the maximal dimension of the image when computing optimal a, b, c, and d. 
The distance to be minimised then becomes D{x, y,z,w). A3x3x3x3 WDT 
will be denoted (a, 6, c, d). 

As the length of any minimal path is defined only by the numbers of steps of 
different types in it, the order of the steps is arbitrary. Therefore, we can always 
assume a minimal path where the steps are arranged in a number of straight 
line segments, equal to the number of different directions of steps used. 

Not all combinations of local distances a, 6, c, d result in useful distance trans- 
forms. The DT should have the following property. 

Definition 2 Consider two pieture elements that ean he eonneeted by a straight 
line, i.e., by using only one type and direetion of loeal step. If that line defines 
the distanee between the pixels, i.e., is a minimal path, then the resulting DT is 
semi-regular. If there are no other minimal paths, then the DT is regular. 

From [8] we have the following result. 

Theorem 1 A distanee transform in 7A that is a metrie is semi-regular. A 
semi-regular distanee transform in 7? is a metrie. 

Thus, all suggested DTs should be semi-regular as this is a necessary but, in 
higher dimensions, not sufficient condition for being metrics. 

As there are four types of steps, there are four types of straight paths pos- 
sible in the hyper cubic grid. To find the conditions for 4D regularity we must 
investigate all the ways these four straight paths can be approximated by paths 
using other steps and find the conditions for the straight path being shortest. 
The result is that a 4D WDT is semi-regular if the following inequalities hold 
(see [3] for a complete description of the method of computation): 

3 4 

a < b, b < 2a, b < c, c < —b, c < d, d < —c. (1) 

Zi O 

These inequalities define a hyperpolyhedron in a, 6, c, d-parameter space. A cut 
through this polyhedron at = 2 and with a as a scale factor is shown in Fig. 1. 

The conditions in (1) may seem restrictive, but they are not sufficient to 
determine unique expressions for the WDTs. If we compute the distances from 
the origin, choosing the shortest paths and assuming that the local distances have 
the properties in (1), we discover (at least) eight different, equally valid cases. For 
example, the hyxel (2, 2, 1, 1) can be reached either as (1, 1, 1, 1) + (1, 1, 0, 0) = 
d -h 6 or as (1, 1, 1, 0) -1- (1, 1, 0, 1) = 2c. The inequalities in (1) do not determine 
which is the shorter path. In 3D there is the same phenomenon, but there are 
only two cases [3]. 

In Fig. 1, the eight cases discovered are marked by thin lines. In each of 
the Cases, expressions could be found for the distance transforms, and the local 
distances could be optimised. However, the area marked “Case I” is the most 
interesting and easiest to handle, as there is only one expression valid for all 
hyxels, and that expression, moreover, is the one we would expect, as it is an 
extensions of the equation in 2D and in Case I 3D. In the triangular hypercone 

b < 2a, c<d, a -\-c< 2b, b-{-d< 2c. (2) 
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Fig. 1. The hyperpolyhedron in a, 6, c, d-space that results in semi-regular 4D 
weighted distances (thick lines). The thin lines separates different cases. The 
grey area is Case I, which will be covered in this paper 

the distance between the origin and (x, y, z, w) is 

Case 1\ D = wd + (^ — w)c -\- {y — z)b + (x — y)a^ for ^<w<z<y<x (3) 

This equation, without the limitation of parameter space (2), is found in [1]. 

The 4D DT analogous to the chessboard DT in 2D is (1,1, 1,1), or 
where the distance to all 80 neighbours is set to 1. The 4D DT analogous to the 
city block DT in 2D is (1,2, 3, 4), or D^ (not to be confused with chessboard 
DT in 2D), where the distance to the eight volume neighbours is set to 1. Both 
D^ and D^^ are semi-regular (but not regular) according to the inequalities 
in (1). The equations for the distances can be expressed as in (3), still with 
^<w<z<y<x\ 



3 Optimality Computations 

In this Section, the optimal local distances for the Case I 4D WDT will be 
computed. Optimality is defined as minimising the maximal difference between 




( 4 ) 

( 5 ) 
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the WDT and the Euclidean distance in an image of size M x M x M x M. 
The choice of optimality criterion is somewhat arbitrary. However, this one has 
the advantage that is does not depend on any non-digital structure, such as 
an imbedded Euclidean sphere, which would be necessary to, e.g., minimise the 
average error. 

The maximum of a particular type of function will often have to be computed. 
The following Lemma is used. 

Lemma 2 The function f{^) = — Ay/y+T^, where \a\ < Vk\X\ and 

I 7 I > 1 has the maximum value 



Proof: The extremal value is found by setting the derivative of f'{f) to zero, 
solving for f and simplifying the resulting expressions. □ 

The difference between the computed distance, see (3), and the Euclidean 
distance is 



where 0 < w < z < y < x < M. This difference is to be minimised in an 
M X M X M X M image. Put the origin in a corner of the image. We can then 
assume that the maximum difference, denoted maxdiff, occurs for x = M. As 
0 < w < z, the maximum of E{w) occurs for ic = 0, d/dwE{y, z^w) = 0, or 
w = z. The difference in these three cases are found by simple insertion or by 
using Lemma 2 with a = {d — c)^f3= {c-b)z-\- {b — a)y Ma^ X = A; = 1 , 

and 7 = We get: 



Ei{y, z) = {c — b)z + (6 — a)y P aM — M‘^ Py‘^ -\- z‘^ for (M, z, 0), 

z) = (c — b)z + (6 — a)y + aM — ^yl — {d — y‘^ -\- z‘^ 



Es{y, z) = {d — b)z + (6 — a)y + aM — P y‘^ 2z‘^ for (M, z, z). 



Eor each of these three expressions the maximum can occur for z = 0, 
dldzE{y^ z) = 0, or z = ^, as 0 < z < ^. We get the following nine differ- 
ence expressions, using insertion and Lemma 2 . 




E{x, y, z, w) = {d — c)w + (c — b)z + (6 — a)y P xa — , ( 6 ) 



for 
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Eii{y) = (p — o)y + ciM — ^/M‘^ + y‘^ for (M, y^ 0, 0), 

Ei 2 i,y) = (b- a)y + aM - +y‘^ for (M, y, Zmax, 0), 

Eisiy) = (c - a)y + aM - + 2y^ ioi (M,y,y,0), 

E 2 l{y) = 0, as 0 = Z < W = Wniax, 

E 22 {y) = (b- a)y + aM - 

for (iVf, y, -S^jnaxi ^max)- 

E 2 a{y) = (c - a)y + aM - a/ 1 - (d - c)'^^/M‘^ + 2y^ for (M, y, y, Wmax), 

-B3i(y) = -Bii(y), 

Ez 2 {y) = {b- a)y + aM - ^1- \{d - ^ M'^ + y'^ for (M, y, Zmax, Zmax), 

Ezaiy) = {d- a)y + aM - a/aPT^^ for (M, y, y, y). 

( 7 ) 

For each of these seven expressions the maximum can occur for ^ = 0, d/dyE{y) 
= 0, or^ = M, as0<^<x = M. We get the following 21 expressions, again 
using insertion and Lemma 2. 



Em 

E\12 

Eu3 

E\21 

E\22 

E\23 

E\31 

E\32 

E\33 

E 22 I 

E 222 

E 223 

E 23 I 

E232 

E233 

E32I 

E322 

E323 



{a-l)M 
(a — 

{b - v^)M 

0 , as 0 = ^ < Z = Zmax, 

(a — y^l — (c — by — (b — aY)M 
(b - ^/2^1 - {c-bY)M 



for (M, 0,0,0), 

for (M,^max,0,0), 

for (M, M, 0, 0), 

for (-A^, ^max : -^max: 0) : 
for (M,M,Zmax,0), 



Emj 

{a^l - \{c - aY)M for (M, ^max, ?/max, 0), 

{c-V3)M for (M, M, M, 0) , 

0, as 0 = ^ < Z = -^max 

(a - v^l - (d - c)2 - (c - 6)2 - (6 - a)2)M 

for (AL, ^niax5 '^max^ "^max)5 

{b - a/ 2^1 - (d - c)2 - (c - bY)M for (AL, AL, Zmax, '^max), 

0, as0 = ^ = Z<ie = ICmax 



(a — — (d — cY — \{c — aY)M 

(c - v^v^l-(d-c)2)AL 
0, as 0 = ^ < Z = Zmax, 
a — — \{d — bY — {b — aY)M 

{b - V2^Ji-^{d-by)M 



for (AL, ^rnax 9 ?/max9 ^ max) 5 
for (M,M,M,Wmax), 



for {M,ymax, -^max^ '^max) : 

for (AL, AL, Zmax5 '^max) : 
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^331 = ^ 111 , 

-£'332 — i^CL 2^1 {d for (-A£, ^max5 ?/max: ?/max): 

£^333 = {d - 2) M for (M, M, M, M). 

Thus 15 difference expressions Eijk remain. The maximum of these 15 expres- 
sions should now be minimised by varying and d. Numerical experimen- 

tation show that mdiX (Eijk) is minimal for E 222 = —Em = —Em = —Em = 
—£^333. Solving these equations yields 



^opt — 1 ^ ^ 


s 0.9048, 


(.t 

1 

II 

§• 


a 1.3191, 


^opt — IZ I- 


a 1.6369, 


dopt = 2 — 7Z ^ 


a 1.9048, 


with maxdiff = 7ZM ?= 


a 0.0951M, 



(8) 



where IZ = 2 + 2a/3 + V2 — 7). 

The optimal solutions for a = 1 are needed when computing integer DTs, as 
then a becomes a scale factor. In this case, we solve £^222 ~ “£"ii3 ~ “£"133 ~ 
— T^333, where the star denotes that a = 1 has been substituted in the expressions. 
The solutions are 

<pt = h 

Kpt = ^-^ ^ 1.2796, 

= ^ 1.5975, (9) 

dl^, = 2-S ^ 1.8654, 

with maxdiff* = SM ^ 0.1346M, 

where S = — \/ VQ + 2a/3 + V2 — 7. 

In both situations, free a and a = 1, it is easy to check that the optimal 
solutions fulfil the inequalities (2), and thus are in the allowed hypercone in 
parameter space. Also, in both situations the maximum difference from the Eu- 
clidean distance is a fraction of the size of the image, as can be expected. 

Using real valued local distances in digital images is generally not desirable. 
Integer local distances are preferable. Candidate integer approximations of the 
optimal values, denoted A, 5, C, and £), are found by multiplying the optimal 
local distances by an integer scale factor and rounding to the nearest integer. 
Then the maximal differences are computed (all expressions are available from 
the computations of the optimal local distances), to check the approximations. 
The smallest local distance, a, will act as a scale factor, therefore the result- 
ing WDT will become (1, 5/A, C/A, £)/A). It is of course important to check 
that (2) are fulfilled, otherwise the difference expressions are invalid. The best 
approximation result possible is maxdiff* and the optimal local distances to be 
multiplied by the scale factor are 6*^^, c*^^, and Good integer 3 x 3 x 3 x 3 
WDTs are listed in the next Section. 
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Table 1. Integer 3 x 3 x 3 x 3 distance transformations 



Case 


a 




C 


d 


maxdiff 


D® 


1 


- 


- 


- 


2.00000 


D80 


1 


1 


1 


1 


1.00000 


real 


Uopt 


hopt 


Copt 


dopt 


0.09515 


real 


1 


hopt 


'-'opt 


hopt 


0.13456 


integer 


2 


3 


4 


4 


0.29289 


integer 


3 


4 


5 


6 


0.18350 


integer 


6 


8 


10 


11 


0.16667 


integer 


6 


9 


10 


11 


0.16667 


integer 


7 


10 


12 


13 


0.15485 


integer 


8 


11 


13 


15 


0.14304 


integer 


15 


20 


24 


28 


0.13590 



4 Results 

In this section the results of the optimality computations are summarised and 
illustrated. Table 1 lists a number of distance transforms. First, the simple D® 
and D®o are listed, with their associated maxdiff. These are easily computed 
from the expressions (4) and (5). Next in Table 1 comes the optimal values for 
Case I, both for free a and a = 1 . 

After the real valued WDTs, the best integer approximations, using scale 
factors (= A) up to 20, are listed in Table 1. For practical purposes, (3, 4, 5, 6 ) 
is probably the best choice. The maxdiff is reasonably good, with a small scale 
factor. Note that it is hard to improve on (15,20,24,28). Note also that all 
integer DTs are on the border of the allowed hypercone defined by ( 2 ), except 
(7, 10, 12, 13), which should thus exhibit the most “typical” traits of Case I DTs 
in 4D. 

It must be remembered that even if Case I is the “natural” one of the Cases 
for 4D DTs, there is no guarantee that it is the best Case. In 3D, the analogous 
Case I 3 gave the best maxdiffs hut the other case. Case II 3 , gave the best maxdiffg 
(with a = 1 ), so better integer approximations could be found for Case II 3 than 
for Case I 3 , see [3]. The same may well be true in 4D. 

A good way to characterise a DT is the shape of its associated ball, defined 
as all pixels/ voxels /hyxels with a distance less than or equal to the radius from 
a single central element. In 2D, the city block and chessboard distance balls are 
a diamond and a square, respectively. In 4D, the and balls are a hyper- 
octahedron and a hypercube, respectively (the tetrahedron, the octahedron, and 
the cube are the only “Platonic” solids that exist in any dimension, see [4]). In 
4D, the Case 13x3x3x3 WDT balls are hyperpoly hedra. 

Illustrating hyperpoly hedra is, however, not very easy. One way of doing this 
was presented in [5]. A 4D digital image is created, with a single object hyxel 
in the middle. The DT is then computed from this object into the background, 
in the standard way. If this image is thresholded at a suitable level, a ball with 
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the radius of the threshold value is created. The threshold should be as large 
as possible while the ball created is still completely within the image. We now 
have a binary 4D image containing the ball we wish to visualise. If we fix the re- 
level in this image, we will get a 3D image with a “hyperslice” of the 4D ball, 
which is in itself a polyhedron. This 3D object can be visualised using a simple 
binary 3D imaging technique. Ideally, the consecutive hyper slices can be shown 
as a sequence, a “movie,” but here we are constrained to show a few sample 
hyperslices. In Figs. 2 and 3, we show the (3, 4, 5, 6) and the (7, 10, 12, 13} balls 
with radius 46. The six “hyper slices” were chosen so that the different shapes 
the ball has at different levels are shown. They are not equally spaced in 4D, 
see the Figures for the chosen re-values, where re = 1 denotes the slice with the 
“first” ball hyxel. These two WDTs were chosen, as the (3, 4, 5, 6) is the one 
most probable to be used and the (7, 10, 12, 13) is the only one exhibiting all the 
faces that a general 4D Case I WDT can have. The “mid-slice”, w = 46, is a 
(3,4,5) ball and a (7, 10, 12) ball, respectively (see [3]). 

5 Conclusions 

Optimal weighted distance transforms in 4D using 3 x 3 x 3 x 3 neighbourhoods 
have been investigated. The best possible such DT, using real-valued weights, 
has a maximal difference from the Euclidean distance of 9.51%. The best possible 
integer valued DT is proven to have a maximal difference of 13.46%. The most 
useful WDT is probably (3, 4, 5, 6) with a maximal difference of 18.35%. This 
was, in fact, what was suggested already in [1], but there the motivation was 
much weaker. A number of other integer WDTs with higher scale factors, but 
with smaller maximal differences are also listed. 

Acknowledgement 

Thanks to Stina Svensson for computing and visualising the two balls in Figs 2 
and 3. 

References 

1. G. Borgefors, Distance transformations in arbitrary dimensions, Computer Vision, 
Graphics, and Image Processing 27 , 1984, pp. 321-345. 325, 326, 328, 333 

2. G. Borgefors, Distance transformations in digital images Computer Vision, Graph- 
ics, and Image Processing 34, 1986, pp. 344-371. 326 

3. G. Borgefors, On digital distance transforms in three dimensions. Computer Vision 
and Image Understanding, Vol. 64, No. 3, (1996), pp. 368-376. 326, 327, 332, 333 

4. H. S. M. Goxeter, Regular polytopes, Dover Publications, Inc., New York, 1973. 
332 

5. G. Borgefors, H. Guo, Weighted distance transform hyperspheres in four dimen- 
sions, Proc. SSAB Symposium on Image Analysis 1997, Stockholm, Sweden, March 
1997, pp. 71-76. 332 



334 Gunilla Borgefors 



6. M. Fidrich: Iso-surface extraction in 4D with applications related to scale space, 
In Miguet, Montanvert, Ubeda, Eds., Discrete Geometry for Gomputer Imagery, 
Springer 1996 (LNGS 1176), pp. 257-268. 325 

7. P. P. Jonker and O. Vermeij, On skeletonization in 4D images. In Perner, Wang, 
Rosenfeld, Eds., Advances in Structural and Syntactical Pattern Recognition, 
Springer 1996 (LNGS 1121), pp. 79-89. 325 

8. G. O. Kiselman, Regularity properties of distance transformations in image anal- 
ysis, Computer Vision and Image Understanding, Vol. 64 , No. 3, (1996), pp. 390- 
398. 325, 326, 327 

9. T. Y. Kong, Topology-preserving deletion of I’s from 2-, 3-, and 4-dimensional 
binary images. In Ahronovitz and Eiorio, Eds., Discrete Geometry for Gomputer 
Imagery, Springer 1997 (LNGS 1347), pp. 3-18. 325 

10. A. Rosenfeld and J. Pfaltz, Sequential operations in digital picture processing. 
Journal of the ACM 13 (4), 1966, pp. 471-494. 325 

11. M. Yamashita and T. Ibaraki: Distance defined by neighborhood sequences. Pattern 
Recognition 19 , 1986, pp. 237-246. 326 



Some Weighted Distance Transforms in Four Dimensions 



335 




Fig. 2. The ball of the (3, 4, 5, 6) distance transform, shown as 3D cuts through 
4D space at six different levels 
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Abstract. The paper describes the combination a multi- views approach 
to represent connected components of 2D binary images. The approach 
is based on the Object Connectivity Graph {OCG), which is a sub-graph 
of the connectivity graph generated by the Discrete Cylindrical Alge- 
braic Decomposition{DC AD) performed in the 2D discrete space. This 
construction allows us to find the number of connected components, to 
determine their connectivity degree, and to solve visibility problem. We 
show that the CAD construction, when performed on two orthogonal 
views, supply information to avoid ambiguities in the interpretation of 
each image component. The implementation of the algorithm is outlined 
and the computational complexities is given. 

Keywords: shape representation, shape decomposition, shape descrip- 
tion, digital topology 



1 Introduction 

The combination of multiple views of the same object can be used to improve its 
reconstruction. In principle, it is possible to recover the whole object informa- 
tion from partial ones; the number of views that are necessary to fully recover an 
object-representation depends also on the feature space. For examples in [1] two 
views are considered to reconstruct convex polyminoes; computerized tomogra- 
phy [2] is another example of reconstruction from projection. In [3] the symmetry 
transform, performed on multiple views, is used for object representation and 
classification. 

The paper describes how to combine the information of two orthogonal views 
to represent the connected components in 2D binary images. The combination 
method uses the Connectivity Graph (CG) as it is derived from the Cylindrical 
Algebraic Decomposition (CAD) introduced by Collins [4] for the decomposition 
of the Euclidean space, E^. In [5] the CAD construction has been extended to 
the analysis of components of a set of points of a 2D discrete space, D C V^, 
where N is the set of natural numbers and it has been applied for the solution 
of geometry problems of the first and the second order [6,7], to computer aided 
design, and shape analysis. 

The CAD-algorithm performs a cellular decomposition [8,9] of the projective 
plane in open ball, also named i-cells, with i > 0. For example in a 0- 
cell is a point, 1-cell is an open arc, and 2-cell is an open region. The cellular 
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decomposition of E‘^ may be defined as a nested sequence C C C E‘^ 
of closed subspaces, such that X^ consists of finitely many 0-cells, X^ consists 
of finitely many disjoint 1-cells, X^ consists of finitely many disjoint 2-cells. 

Let us describe the classical CAD-algorithm with an example. Let C be an 
algebraic curve in the real projective plane (see Fig. la): 

— 2xy^ — x^y^ + + 2x^y -h — 1 = 0 

then a cellular decomposition can be performed as in Fig. lb, by intersecting C 
with linear varieties. The decomposition is arbitrary, however after that it is 
performed the i-cells are determined, and they may be used to study C, and its 
connected components (< ( 2 ) — 1, where n is the degree of C). Two distinct cells 





Fig. 1. a) Algebraic curve C in the projective real plane; b) The Cylindrical 
Algebraic Decomposition 



are adjacent if they touch each other. Clearly adjacency is a symmetric relation. 
Therefore the CG nodes are cells, and the arcs represent the adjacency between 
cells. 

The extension of the CAD decomposition to a digital spaces is named Dis- 
crete CAD {DC AD) [5]. Its definition is straightforward by providing the proper 
definitions of i — cell, and connectivity relation on D [10,11]. 

The DC AD provides unambiguous topological information (connected com- 
ponents, holes); however, the structural description of a given component could 
be ambiguous. This ambiguity can be avoided by using two or more views. In the 
paper we show that two views are sufficient to avoid shape ambiguities that are 
on the component borders. An algorithm for combining two orthogonal views of 
the DC AD construction is also given and its computational complexity outlined. 

The paper is organized as follows. Section 2 describes the DCAD-algorithm. 
The algorithm to combine the CG is described in Section 3. Section 4 reports 
results and implementation notes. Final remarks are given in Sections. 
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2 The Discrete CAD Algorithm 

In the following the internal borders of connected components in will play 
the role of the algebraic curves. In this case the C AD-algoiithm consists in 
the partition of the space by means of parallel straight lines that intersect the 
borders of connected components. The i-cells, generated by the decomposition, 
are points (p), bend-points ( 5 ), open lines representing borders (6), open lines 
representing cylinders (c) and open 2D regions (r). 

Starting from this decomposition a labeled not oriented connectivity graph 
{CG) is derived, nodes of which are of type p, 5 , 6, c, and r. The arcs of a 
CG connect only nodes of different type that are adjacent. Fig.s 2a, b show an 
example of GAD^ and the corresponding CG in E^. An ordered pair of integer 
number, (L, i?), is assigned to each node of the GG; where L and R are named 
the left and right labels respectively. This labeling allows us to identify univocally 
nodes of type c, r, and 6, while nodes of type p and 5 have the same kind of pair, 
as stated below. 



R/L 


Odd 


Even 


Odd 

Even 


c 

p,s 


r 

b 



This labeling rule is determined by scanning the plane left to right (bottom 
to up) , and increasing by 1 L (R) each time a new element of the decomposition 
is intersected (initially L = R = 0). For example the region rl in Fig. 2a is labeled 
(0, 1), the line c2 is labeled (1,3) the line 61 is labeled (2, 2) and the point p2 is 
labeled (3, 2), see Fig. 2b. In E^ an infinity number of GAD^s can be performed; 
in fact, it is possible to consider all possible directions of straight lines. Here, 
only straight lines, parallel to the T(X)-axis that intersect concavity, convexity, 
and bend-points are selected. 

The T)CAT)-decomposition is obtained by considering as cylinders digital 
straight paths in D that are parallel to the y(X)-axis. Moreover, the cylinders 
selected cross the internal borders of the components in where one of the con- 
figurations of pixels shown in Fig. 3 is present. These configurations correspond 
to concavities, convexities, and bends of digital borders. 

The DGAD construction may generate empty regions between two adjacent 
cylinders. In the algorithm these regions are treated as virtual regions^ and their 
labeling is determined by considering the labels of adjacent cylinders. Each of 
these configurations became a single vertex in the construction of the GG. The 
labeling of CG is determined as in the continuous space, by scanning D left-right 
and bottom- up. The arcs represent the adjacency relation among points, paths 
and regions. 

Note that the 4-adjacency holds between points, paths, and internal regions; 
while 8-adjacency holds between points and external regions. 

2.1 Properties of the CG 

The GG allows to retrieve the connected components of binary images, and to 
derive information about the visibility-problem, that is stated as follows: for 
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Fig. 2. a) Example of CAD in b) corresponding Connectivity Graph of the 
CAD 

a given illumination of a scene, with more then one component, decide which 
component is totally or partially hidden. 

The CC derived is planar graph three colorable; its faces are triangles, nodes 
of which, represent points, bend-points, lines, and 2T)-regions. As said before, 
each node has three labels {X, L\, R\), with A G {p, 5,6, c, r}, that represent, in 
the order, its type, left, and right labels. 

Proposition 1. A simple cycle, C, ofCC, such that nodes of type p (s) (Lp(^s^ 
is odd and Rp(^s) even), are connected to nodes of type b (Lb and R^ are both 
even), represents an internal border of a connected component. 

The internal regions of a component of D are obtained as follows: 

Proposition 2. The internal region of a component, represented by the cycle C, 
is the union of the regions, such that their labeling (X, L\, R\), X G {p^s}, sat- 
isfies the condition: L is even and RmodA = 3 and are connected to some nodes 
of type p (s) of C. An analogous statement can be formulated for the exter- 
nal regions of a cycle (component) , by replacing the condition RmodA = 3 with 
RmodA = 1 . 
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Fig. 3. Basic configurations: left concavity {LC); right concavity {RC); right 
bend {RB) and left bend {LB); left double point {LTF); right double point 
{RTF); right bend of type 1 {RBI); left bend of type 1 {LBl); right bend of 
type 2 {RB2); left bend of type 2 {LB2) 



Proposition 1 and 2 are straightforward; they follow from the ordering that is 
settled in the nodes of CG, during the computation of the right and left labels. 

In the case of simply connected components the number of cycles is equal to 
the number of connected components in D. For example, the cycle (bold arcs) 
in Fig. 2b represent the connected components in Fig. 2a. Connected components 
may have A/^-holes {N > 0) and the number of components and their holes are 
retrieved by searching cycles in CG. In this case it is necessary to determine the 
relation of inclusion between cycles. The algorithm generate a graph is a forest of 
rooted oriented trees, nodes of which represents borders. Oriented arcs represent 
the relation to he directly included. Nodes of the forest represent a component 
iff it belongs to an even level. A node of an even level, with A^-sons, represents 
a component with A-holes. 

The inclusion relation is easily determined by comparing the highest, and 
lowest left/right labels of their nodes. 

Proposition 3. Given two simple cycles, Gi, Gj , ofGG, representing internal 
borders of connected components in D, then Gi C Gj iff the inequalities: 

Lm{Gi) > Lm{Gj) LM{Gi) < LM{Gj) 

and 

Rm{Gi) > Rm{Gj) RM{Gi) < RM{Gj) 

are all satisfied. Where Lm{Gi), Lm{Gj), LM{Gi), LM{Gj), Rm{G{), Rm{Gj), 
RM{Gi), RM{Gj) are the lowest and highest left/right labels of the nodes in Gi, 
and Gj . 

Here, we omitted the labeling information to light the notation. This propo- 
sition is easily proved by considering again the ordering of the nodes in GG, and 
by the fact that two cycles, representing regular components in D, cannot have 
common nodes. 
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3 Two Views Combination 

In the following we will consider the sub-graph of the CG, named Object CG 
ipCG)^ that is obtained deleting from the GG all nodes and the corresponding 
arcs representing components of the background of the digital scene (the value 0 
is assigned to the background by convention). The label of each node is preserved 
therefore all results, obtained in the previous section, still hold. The connected 
components of the OGG represent objects in the scene. The OGG allows us 
to store less space memory and makes easier the visualization of the results. 
Moreover, the A-label will not be explicitly shown in OGG. 

In the following we denote by DGADy(x) the DGAD along the y(X)-axis 
and by OGGy{x) the corresponding graphs. The DGADx is performed by scan- 
ning the image in a top down and left-right order (see Fig. 4). 





4 - 

T 



1 




Fig. 4. An example of DGADy and DGADx and related graphs 



The DGAD of a 2T) digital scene and the related OGG includes by its def- 
inition all topological information regarding the connected components in the 
scene. However, in the case on non-convex components structural information 
can be lost or could be ambiguous if we consider only one view. 

In the case of binary images most of the shape information is on the borders. 
The ambiguities, we want to eliminate, regard the positions and orientation of 
bend points and concavities (these one can be considered the combination of two 
bend points. 

For example, the DGADy of the left and right images in Fig. 5 generate 
the same OGGy (see Fig. 6), even if the bottom components have a different 
structure (the bend-points are in the opposite position). Note that the ordering 
of the A-labels remains the same in both configurations. 

Fig. 7 shows the DGADx of the previous images. The OGGx in Fig.s 8a, b 
corresponds to the left and right images in Fig. 7 respectively; note hat in this 
case the two OGGx^^ are different; because the ordering of the A-labels is not 
preserved. In the following we will show how to combine the two OCG’s to solve 
this ambiguity. 
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Fig. 5. DCADy of the left and right images 



s 




Fig. 6. The OCGy of the images in Fig. 5 



The following properties are used in the algorithm for combining the OCGy 
and the OCGx graphs. 

Proposition 4. A connected component of an OCG starts with at least one 
node with label (p,L,R) with L odd and R even, which is connected to nodes 
(A, Z/ T 1, R^f (A, Z/ T 1, Z^l T 1); (Z/ T 1, Z^l T 2) with Z^l ^ R even. 

This proposition follows from the scanning rules. 

Proposition 5. A sequence of nodes labeled (p,L,R), (c,L,R-\-l), {p, L, R-\-2) 
of a connected component of an OCG can be deleted with the arcs and the nodes 
on the left and right of the triple can be merged as follows: {(A, L — 1, x), (A, L + 
l,x)} — > {X, L — l,x) for X = R, R-\-l, R-^2. The reduced connected components 
maintain both topological and structural properties. 

Proof. In fact, the cylinder represented by the nodes (p,L,R), {c,L,R + 1), 
(p, L,R-\-2) makes a partition of a rectangle into two sub-rectangles that merged 
do not change neither the topology neither the structure of the image component. 
The components obtained are said normalized. 

In the following the sketch of the combining algorithm is given. The time 
complexity of each step is also evaluated as a function of the linear size n oi D 
and the number of components Nc in the image. 
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Fig. 7. DCADx of the same images 




P 



Fig. 8. a) The OCGx of the left image; b) the OCGx of the right image 



- Algorithm COMPOUND 

1 Input OCGy ^ OCGx 

2 Components association 

In this step the labeling, generated by the scanning rule, is used. Components 
in OCGx are selected by searching for starting nodes that satisfy Proposition 
4 that are yet marked and with the lowest labels L and R. The association 
is made with the corresponding not marked component in OCGy that has 
the highest Y spatial value. The associated components are marked. The 
association stops after that all components in OCGx and OCGy have been 
marked. The time complexity of this step is 0{Nc x Nc). 

3 Component normalization 

Using iteratively the reduction rule stated in Proposition 5 performs the 
normalization of the OCGy{X). The time complexity of this step is 0(n x 
Nc). 

4 Component combination 

The combination of the associated normalized components is performed by 
considering that for a given odd value of L > 1 the sequence of labels is of 
the form 



pc{scYp U pc{scYs U {scYp U (5c)*5 

Moreover, by using the 9 correspondences between the sequence OCGy and 
OCGx (see Fig. 9 and Table 1) it is possible to set the position of bend-points 
in each component eliminating the related ambiguities. 
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5 Output structural and shape information about each component 
- end COMPOUND 

The time complexity of this step is 0{n x Nc). 



t [? =D 

(a) <c) ( 4 > <«> 

(f) <S) (i> 



Fig. 9. The 9 correspondences between OCGy and OCGx 



Table 1. Words corresponding to configurations in Fig. 9 
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Config. 
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Fig.9a 
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Fig.9b 
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Fig.9c 


scp 
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Fig.9d 


scp 


scp 


Fig.9e 


scp 


scs 


Fig.Qf 


scs 


pcs 


Fig.Qg 


scs 


scp 


Fig.9h 


scs 


scs 


Fig.Qi 



Proposition 6. The combination phase of the algorithm COMPOUND elimi- 
nates ambiguities due to position and orientation of bend points. 

Proof For binary images, represented in a squared lattice, configurations, given 
in Fig. 9, cover all possible cases. Ambiguties are then solved by parsing left to 
right the words corresponding to the same object in OCGx and OCGy and 
testing the occurrencies given in Table 1. 



4 Implementation Notes 

The serial version of the T)(7AD-algorithm has been implemented in C++ under 
Linux. The whole computation time of the COMPOUND algorithm is, as we 
have seen, of the order 0{n x Nc). Note that in most applications Nc <C n. The 
computation time to perform the DC AD algorithm is of the order 0{n^) and it 
is the more expensive part of the whole computation. In [12] a parallel version 
of the T)(7AT)-algorithm is given. 

The CPU time to run DC AD and COMPOUND algorithms on a PEN- 
TIUM III 400MHz was of about Tjjcad — 0.5sec + Tcompound = 30msec for 
256 X 256 synthetic images containing at most 10 components. The maximum 
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M-’^\ “S4idE ( Jifu S— I th^ 

Fig. 10. The DC AD construction of a binary image 




OCG^ 

Fig. 11. The OCGx and OCGy of the binary image in Fig. 10 



number of nodes to be explored was 530 in the case of ten components. The 
right side of Fig. 10 shows the DC AD of a binary image on the left in both views 
(X, y). The top and the bottom of Fig. 11 show the corresponding OCGx and 
OCGy respectively. 

Either the discrete nature of D or the noise background may generate dummy 
bend- nodes. They have been removed by using heuristic arguments that are 
related to the image size; for example discard bend-points with length less than 
0.05 X n (e.g. in our case length< 10). 
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5 Final Remarks 

The paper shows a new approach to the analysis of connected components of 
binary images. The DC AD algorithm allows us to analyse topological properties 
of binary images. In the paper it is show how to combine the DC AD along two 
views in order to provide a structural description of digital images. The ordering 
of the OCG’s is at the basis of the search and combining algorithms, allowing 
them to be tractable and faster. The extension of the DC AD on 3D spaces is 
under study. 
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Abstract. The use of plane graphs for the description of image structure 
and shape representation poses two problems : (1) how to obtain the set 
of vertices, the set of edges and the incidence relation of the graph, and 
(2) how to embed the graph into the plane image. Initially, the image 
is represented by an embedded graph G in a straight forward manner, 

i.e. the edges of G represent the 4-connectivity of the pixels. Let G denote 
a (planar) abstract dual of G. Dual graph contraction is used to reduce 
the pair (G, G) to a pair (iL, H) of planar abstract duals. Dual graph 
contraction is unsymmetric due to an extra condition on the choice of 
the contraction kernels in G. This condition is shown to be necessary 
and sufRcient for H to be embedded onto G. The embedding is applied 
to the description of image structure and to shape representation. 



1 Introduction 

A key concept in combinatorial topology is the separation of topology (the graph) 
and embedding (of the graph) [Fra96]. In a hierarchical representation of an 
image by a sequence of plane graphs on increasing levels of abstraction and 
scale, however, the hierarchy should also be reflected by the embedding. Thus, 
the embedding of a lower level graph induces constraints on the embedding of 
higher level graphs. Let a high and a low level graph be denoted hy H = {Vh^ Eh) 
and L = {Vl, El) respectively. The hierarchy is reflected by the embedding of H 
and L (Fig. 1), if 

1. Vh C W, 

2. each embedded edge from Eh (as subset of the plane) is a union of embedded 
edges from El^ 

3. each region from the embedding of H (as subset of the plane) is a union of 
regions from the embedding of L. 

Note the transitivity of the above conditions. The high level graph is a con- 
nected topological minor of the low level graph [Die97]. This paper is devoted to 

* This work has been supported by the Austrian Science Fund (FWF) under grant 
S7002-MAT and grant P14445-MAT. 



G. Borgefors, I. Nystrom, and G. Sanniti di Baja (Eds.): DGCI 2000, LNCS 1953, pp. 348-357, 2000. 
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hierarchies of graphs as obtained by dual graph contraction [KM95]. It will be 
shown that an embedding of a higher level graph may always be derived from 
the embedding of a lower level graph such that the above conditions are fulfilled. 
Note, however, that dual graph contraction yields two hierarchies of graphs: on 
a planar graph and on its (planar) abstract dual. It will turn out that, in gen- 
eral, it is impossible to obtain a proper embedding of both hierarchies. Proper 
embedding of graph hierarchies is crucial for document image analysis [BK99] 
and the description of image structure [GEK99] . In this paper we propose a new 
application for dual graph contraction in which embedding is crucial, i.e. shape 
representation. The outline of the paper is as follows: In Section 2 dual graph 





Fig. 1. Hierarchical embedding, (a) Low level, (b) High level 



contraction is described in terms of equivalence relations. In Section 3 the em- 
bedding of high level graphs onto low level graphs is related to the unsymmetry 
of dual graph contraction. Section 4 demonstrates the use of the embedding for 
the description of image structure. Section 5 is devoted to shape representation 
by means of embedded graphs that describe the structure of distance transforms. 
We conclude in Section 6. 

2 Dual Graph Contraction by Equivalence Relations 

Throughout the paper we refer to the following definition of a graph. 

Definition 1 (Graph). A graph G = (V,E) is given by a finite set V of ele- 
ments ealled vertiees, a finite set E of elements ealled edges with E^V = ^ and 
an ineidenee relation l whieh assoeiates with eaeh edge e ^ E a subset of V with 
one or two elements. The vertiees in t{e) are ealled the end vertiees of e. 

Note that the definition includes graphs with self-loops (i.e. edges with only 
one end vertex) and multiple edges (i.e. several edges with identical sets of end 
vertices) . 
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Let {GjG) denote a pair of planar graphs, where G is an abstract 
dual [Die97] of G. We write G = (V,E) and G = (V^E). Dual edge eontrae- 
tion [Kro95] [KM95] of the pair (G, G) is specified by contraction kernels which 
form a spanning forest E of G. The result of dual edge contraction consists of a 
pair denoted by {G/E,G\ E), in which 

— G/E is obtained from G by contracting the edges of E as shown below. 

— E denotes the subgraph of G that is induced by the edges which are dual to 
the edges of E. 

— G\E is obtained from G by the removal of all edges from E. 

In the following, the vertex set, the edge set and the incidence relation of G/E 
are defined by means of an equivalence relation on the vertex set V oi G. 

The spanning forest E partitions V : each vertex belongs to exactly one con- 
nected component of E. Hence, the binary relation defined as 

V :<t^ V and w belong to the same connected component of T (1) 

is an equivalence relation on V. Let the equivalence class of v be denoted by 
[T]-p. The vertex set of G/E equals the quotient ofV by ^-p, defined as 

n -r-= {Nf \veV}. (2) 

If Ejt denotes the set of edges in T, the edge set of G/E equals E \ E^. The 
incidence relation of G/E is derived from the incidence relation of G in 
the following way: For a non-loop e G E \ E^ with ^^(e) = {T, w} set 

For a loop e ^ E \ E^ with ^^(e) = {F} set 

^G/F(e) := {Mp}- (4) 

The second step of dual graph contraction, i.e dual faee eontmetion, consists in 
the contraction of edges from the graph G\E and the removal of the correspond- 
ing dual edges in G/E. The contraction kernels in G \ T are to form a spanning 
forest E' of G \ E. However, each connected component of E' is required to 
contain at most one vertex whose degree in G \ F is larger than two. In the 
following, this constraint will be referred to as degree eonstraint. It assures that 
the removal of the corresponding dual edges in G/E is restricted to so called 
redundant edges^ i.e. parallel edges, or loops that do not surround a subgraph of 
G/F (Fig. 2). _ 

The result of dual graph contraction is a pair (FT, H) of planar graphs, in 
which H is an abstract dual of H. If E' denotes the subgraph of G that is induced 
by the edges which are dual to the edges in F', then 

H = {G\E) /E' and H = (G/F) \ F'. 



(5) 
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edge of G \ F 



edge of G / F 



edge of kernel 



I ) 

\ / 
\ / 

\ / 




(a) (b) 

Fig. 2. Dual face contraction, (a) Contraction kernel in G \ F. (b) Removal of 
redundant edges in G/F (dotted) 



The degree constraint ensures that the degree of a vertex [v]f' in H cannot 
exceed the degree of vertex v in G \ F. Furthermore, the degree of vertex v in 
G \ F is restricted by the degree of the vertex v in G. Hence, if the degree of 
vertex [v]f' in H is denoted by degH{[v]F')^ the following inequation holds: 

degH{[v]F') < degG\F(y) < degciy) \/v e V. (6) 

In particular, the maximal vertex degree in H is restricted by the maximal vertex 
degree in G. 

3 Embedding of Contracted Graphs 

Let G = (V,E) denote a plane graph and assume T to be a spanning forest of G 
that fulfills the degree constraint. An embedding of G/F can be obtained from 
the embedding of G in the following way (Fig. 3): 

1. Interpret the connected components of F as rooted trees: If the connected 
component has a (unique) vertex r, whose degree in G is larger than two, 
declare r to be the root. Otherwise declare any vertex of the connected 
component to be the root. 

2. Let R denote the set of all roots from step 1. For the vertex set of G/T, 
i.e. the quotient V\ ^f of R by ^f the following holds 

V\^F={[r]F\reR}. (7) 

For each r e R set the location of [r]i? to the location of r in G. 
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(a) (b) 

Fig. 3. (a) Graph G. Bold edges belong to spanning forest F. F fulfills the degree 
constraint. Roots are enlarged, (b) Embedding of G/F 



3. Let Ef denote the set of edges in F. The edge set of G/F equals E \ Ep- 
Let e be an edge from E\Ef and let the connected components of F, where 
the end vertices of e in G belong to, be denoted by GG\ and CC 2 . Since the 
roots ri and T 2 of GG\ and GG 2 are the only vertices in GG\ U GG 2 that 
may have a degree greater than two in G, there is a unique path 71 = 77(e) 
from ri to r 2 in G which contains e. Furthermore, for any two edges e, e' the 
paths 71(e) and Tl(e') may only intersect at vertices from 77. The embedding 
of e in G / F is given by the union of all (embedded) edges that belong to 
il(e). 

Note that the embedding of the paths, as described in step 3, fails whenever F 
does not fulfill the degree constraint. Hence, the degree constraint is necessary 
and sufficient for the graph G/F to be embedded onto G. 

4 Image Structure 

Gray level images may be interpreted as digital elevation models [DEM] , in which 
the altitudes are given by the gray levels. In [KD94] the structure of an image 
is defined via the crest lines of the corresponding DEM. In this section, a plane 
graph is constructed, which describes the crest lines in a DEM. It is referred to 
as crest graph. The crest graph for the gray level image in Fig. 4(a) is depicted 
in Fig. 5(a). In contrast to the graph constructed in [GEK99], the crest graph 
has the following properties: 

— The crest graph is embedded such that the edges describe the run of the 
crest lines in the DEM. 
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— The vertex set of the crest graph may include vertices that do not represent 
a hill of the DEM. The extra vertices represent branching points of crest 
lines that are not located on the top of hills. 




Fig. 4. (a) The gray levels of the pixels, (b) The pair (G, G) restricted to the 
square region marked in (a). The vertices of G [G] are depicted as squares [circles] 



The crest graph is constructed in the following way: 

1. The DEM is transformed into a pair {G,G) of attributed plane graphs, in 
which G and G are plane duals [Die97]. The pair (G, G) will later be trans- 
formed into a pair (Gc, Gc), in which Gc is the crest graph. An example of 
(G^G) is given in Fig. 4(b). The vertices of G represent the pixels centers, 
while the edges of G indicate the 4- neighborhood of the pixels. The vertex 
value is set to the altitude associated with the corresponding pixel. Through- 
out the transformation of (G, G) into (Gc, Gc), the value of an edge from G 
is to indicate the lowest altitudes along the embedded edge. Hence, the value 
of an edge e with end vertices u and v is initialized to the minimum of the 
vertex values of u and v. The vertices of G represent the regions of G and 
are located at the crossings of the pixel borders. The edges of G are straight 
line segments that reflect the 4-neighborhood of the regions from G. Their 
values are initialized to the values of the corresponding dual edges in G. A 
vertex T of G is initialized to the minimal value of all edges to surround the 
region (basin) represented by T, i.e. the minimal value of all edges in G that 
are incident to v . Throughout the construction of (Gc, Gc) from (G, G), the 
value of a vertex v from G is to indicate the lowest altitude in the region 
(basin) represented by v. 
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Fig. 5. The crest graph and the river line graph. The vertices which represent 
hills respectively basins are drawn larger 




Fig. 6. Overlay of the crest graph (thick lines) and the river line graph (thin 
lines) from Fig. 5 
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2. The pair {GcjGc) is generated from (G, G) by a sequence of monotonic dual 
graph contractions [GEK99]: Besides the general requirements described in 
Section 2, each contraction kernel of G [G] has to contain exactly one local 
maximum [minimum] of vertex values. The vertex value in the contracted 
graph is set to the maximum [minimum] of the vertex values in the corre- 
sponding contraction kernel. Monotonic dual graph contraction stops when 
no contraction kernel with more than one vertex can be chosen. 

3. The crest graph Gc is embedded into the plane as described in Section 3. 

If monotonic dual graph contraction is applied to the pair (G, G) instead of 
(G,G), the graph G will be transformed into a graph that describes the river 
lines of a DEM. The river line graph of Fig. 4(a) is depicted in Figure 5(b)). An 
overlay of the the crest graph and the river line graph is shown in Fig. 6. Note 
that, in general, the crest graph and the river line graph are not even abstract 
duals. However, embedding the two graphs into the DEM, two basins separated 
by a crest line are always connected by at least one river line that crosses the 
crest line at a local minimum (saddle point). 

The calculation of the crest graph was implemented in based on 

LED A [MN99]. In order to describe gray level edges in images, we calculated 
the crest graph on the image transformed by the Sobel operator (Fig. 7). Note 
that the crest graph is always connected. 




(a) (b) (c) 

Fig. 7. (a) Gray level image, (b) Input image for the calculation of the crest 
graph (Sobel operator on (a)), (c) Crest graph of (b). The widths of the edges 
correspond to the edge values, i.e. the altitudes of the saddle points 
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5 Shape from the Structure of Distance Transforms 

In this section the shape of a 4-connected set S of square pixels is represented by 
an embedded graph. The embedded graph is the crest graph calculated from a 
distance transform of 5, i.e. the gray values indicate the distance of the pixel to 
the outside of S. The calculation of the crest graph is done exactly as explained 
in the previous section. Note that this concept is not restricted to a special grid 
or to a special distance. 

As an example consider the 4-connected set of pixels in Fig. 8(b). The num- 
bers in Fig. 8(b) indicate the chamfer-3-4 distance [CM91] from the pixels to the 
outside. A sphere with respect to the chamfer-3-4 distance is shown in Fig. 8(a). 
The crest graph of the distance transform is depicted in Fig. 8(c). The advantages 
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Fig. 8. (a) Sphere with respect to the chamfer-3-4 distance, (b) Chamfer-3-4 
distance transform of an object, (c) Crest graph of (b) 



of the crest graph compared to the medial axis [Ser82] are as follows. 

— Whatever structuring elements are taken for the computation of the distance 
transform, the crest graph is always connected. 

— No pruning is needed [Ogn94]. 

— The structure of the crest graph, i.e. the crest graph without its embedding, 
yields a more compact and more abstract description of the shape. 

In contrast to the medial axis, shape representation by means of the crest graph 
is lossy. In particular, the crest graph does not represent narrowing dead ends. 
In the future we will try to overcome this drawback by a modification of the 
distance transform: the pixel values at the edge of the image are raised such 
that they form a new summit. The edges which connect this summit with the 
summits in the inner part of the image will run through the narrowing dead 
ends. 
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6 Conclusion 

The hierarchy of planar graphs, as obtained by dual graph contraction, has been 
combined with a hierarchical plane embedding of the graphs. It was shown that, 
in general, it is impossible to find such an embedding also for the corresponding 
hierarchy of the dual graphs. The proposed hierarchical embedding is very useful 
for the description of image structure by monotonic dual graph contraction and 
improves the method proposed in [GEK99]. If the image structure is computed 
on gray values coming from a distance transform of a binary image, monotonic 
dual graph contraction yields a skeleton-like plane graph. In contrast to the 
skeletons formed by pixels no pruning is necessary. Future work will focus on 
the proper representation of narrowing dead ends. 
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Abstract. Visibility partitions play an important role in computer vi- 
sion and pattern matching. This paper studies a new type of visibil- 
ity, re flection- visibility^ with applications in affine pattern matching: it 
is used in the definition of the reflection metric between two patterns 
consisting of line segments. This metric is affine invariant, and robust 
against noise, deformation, blurring, and cracks. We present algorithms 
that compute the refiection visibility partition in 0{{n-\-k) log(n)-hu) ran- 
domised time, where k is the number of visibility edges (at most 0(n^)), 
and V is the number of vertices in the partition (at most 0{n^ -\-k^)). We 
use this partition to compute the reflection metric in 0(r(riA + tib)) 
randomised time, for two line segment unions, with ua and ub line 
segments, separately, where r is the complexity of the overlay of two 
refiection- visibility partitions (at most 0{ua^ -\-tib^))- 



1 Introduction 

The visibility from a particular viewpoint in a pattern gives a local description 
of the pattern. The visibilities from all possible viewpoints give a complete rep- 
resentation of the pattern. This insight led to the use of visibility for pattern 
matching. Visibility has been used for object-recognition as early as 1982, see 
Chakravarty and Freeman [6]. Visibility is defined in terms of affine geometry, 
the concept does not depend on Euclidean distances. Therefore, visibility can be 
used as a tool for affine invariant shape recognition and affine pattern matching. 
We use a strong type of visibility to compute an affine invariant pattern metric, 
called the reflection metric. This metric is affine invariant, and robust against 
noise, deformation, blurring, and cracks, see Section 2. 

A well-studied structure related with visibility is the visibility graph. For 
a collection of n planar line segments, the visibility graph is the graph having 
the endpoints of the line segments as vertices, and edges between vertices for 
which the corresponding endpoints can be connected by an open line segment 
disjoint with all segments in the collection. A trivial algorithm, consisting of 
three nested loops, computes the visibility graph in 0{n^) time. Lee [18] was the 
first to improve this, by giving an 0(n^ log(n)) time algorithm. Optimal 0{n^) 
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time algorithms were found by Welzl [23], and Asano et al. [3]. If the number 
of visibility edges is /c, an algorithm by Pocchiola and Vegter [22] computes the 
visibility graph in 0(nlog(n) + k) time and 0{n) space. 

A plausible approach to using visibility for pattern matching would be visibil- 
ity graph recognition, see Ghosh [9], and Everett [8]. However, visibility graph 
recognition is a computationally expensive problem, see Lin and Skiena [19]. 
Moreover, visibility graphs depend heavily on the topology of the pattern, and 
are therefore by themselves not suitable for robust pattern matching. 

Our approach uses a special form of visibility, called reflection- visibility, for 
defining a similarity measure on line patterns. This measure, the reflection met- 
ric, is affine invariant by dehnition. That is, the distance between affine trans- 
formed patterns t{A) and t{B) equals the distance between the original A and B. 
The reflection metric is robust. It responds proportionally when lines are de- 
formed a little, slightly translated copies of existing lines are added, small cracks 
are made in lines, and small new lines are added. 

To compute the reflection metric we need to know the structure of the visi- 
bility partition corresponding to a special type of visibility. Visibility partitions 
consists of equivalence classes with constant combinatorial visibility. Plantinga 
and Dyer [21] call this structure the viewpoint space partition. The dual of the vis- 
ibility partition is called the aspect graphs see Kriegman and Ponce [17], Bowyer 
and Dyer [5], and Gigus et al. [11]. The number of possible views, the size of 
the visibility partition, was investigated, under varying assumptions, by Agar- 
wal and Sharir [1], and de Berg et al. [7]. For polygons, results about visibility 
partitions were found by Guibas et al. [13], Aronov et al. [2], and Bose et al. [4]. 

In this paper, we focus on the structure of visibility partitions as two-dimen- 
sional arrangements. We will investigate an alternative visibility partition, called 
reflection- visibility. As a start, we consider the standard visibility partition. We 
use an alternative way to define and describe it that is useful for the other types 
of visibility. After that, the analysis becomes more interesting as we proceed 
to the reflection- visibility partition. Let k be the number of visibility edges (at 
most quadratic in n) and v be the number of vertices in the partition (at most 
quadratic in n + /c). We present randomised algorithms that compute the parti- 
tions in 0{{n-\-k) log(n)-hu) time. We use this to compute the reflection distance. 
We assume a model of computation in which the absolute value of any rational 
function, a quotient of polynomials of degree at most d, can be integrated over 
any triangle in 0{d) time. Let A and B be unions of and ub segments, re- 
spectively. If the overlay of the two corresponding reflection-visibility partitions 
has complexity r, the reflection distance between A and B can be computed in 
0(r{nA + '^b)) time. 

2 The Reflection Metric 

The reflection metric, introduced in [16,14], defines a distance between finite 
unions of algebraic curve segments in the plane. If A and B are such unions, 
the reflection distance is denoted as dR(A, H). The reflection metric turns the 
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patterns A and B into functions pA, Pn - ^ after which the integrated 

absolute difference of these functions is normalised. The functions are defined 
using a strong form of visibility, called reflection-visibility. We defer the exact 
definition of the refiection metric until Section 5. 

Figure 1 shows a two-dimensional pattern A consisting of a finite number 
of straight line segments. Figure 2 shows the corresponding function evaluated 
on a discrete lattice, represented as a grey-scale image in which black corre- 
sponds with value 0. The example pattern is hieroglyphic ‘Al’ obtained from 
the hieroglyphics sign list, see [12]. 




Fig. 1. A straight line pattern 2; The function pA evaluated 

on a discrete lattice 



Let T be the group of affine transformations on The refiection metric is 
invariant for T, meaning that d^{t{A)A{B)) = d^{A^B) for any affine transfor- 
mation t G T. As a result, d^ can be used to construct a metric on affine shapes, 
patterns modulo affine transformation: 

Dr{T{A),T{B)) = Md^{t{A),B)^ 

The refiection metric is robust for various types of 
effects caused by discretisation and unreliable feature 
extraction. Slight deformations of patterns only in- 
crease or decrease distances slightly. Introducing blur, 
by adding new lines near existing lines in a pattern, 
only causes a proportional change in the refiection dis- 
tance. Making craeks in the interior of lines, splitting 
them up into multiple smaller ones, only changes the 
distance proportional to the length of these cracks. 

Adding noise in the form of new lines, far away from other lines, only increases 
the distance proportionally to the length of the added line. Figure 3 illustrates 
the effects of deformation, blur, cracks, and noise. The left pattern is the “orig- 
inal” , the right pattern is “affected” by the four types of distortion. 

In Section 5, we show that the refiection metric can be computed by con- 
structing and traversing an arrangement, which is the overlay of two reflection- 
visibility partitions. In the following sections, we characterise the visibility and 




Fig. 3. Deformation, 
blur, cracks and noise 
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reflection visibility partitions. We will give a randomised algorithm that is opti- 
mal in the number of segments n. 

3 Visibility Partitions 

Let S = { . . . , } be a collection of closed line segments and let P = 

{pi, • • • iPm} be the corresponding set of endpoints. In all that follows, we as- 
sume that the endpoints are in general position. For convenience, set A = IJ S. 
We say a point y G is visible from x G if the open line segment ~Py is 
disjoint with A. For any viewpoint x G define the visible part of A as the 
subset Vp^(x) C A given by: 



This set is sometimes called the visibility polygon of x in A, see [3]. The visibility 
star \stA{x) is the union of all open line segments connecting the viewpoint x 
with the visible part of A: 



Visibility stars are similar to view zones^ see [20] pp. 383-391. A view zone is 
a visibility star extended with all infinite rays from x disjoint with A. Figure 4 
shows a visibility star for an example consisting of eight line segments with 
thirteen distinct endpoints. The visible part Vp^(x) is drawn thick. The visibility 
star YstA{x) is the light grey region, including the dotted lines. 

Consider the endpoints and segments bounding the visibility star VstA(a:) 
ordered by slope with respect to x. This describes the structure of the visibility 
star. The visibility star is a finite union of triangles. Each triangle is an intersec- 
tion of three half-planes. Two of the half-planes are bounded by lines through x 
and a point in P. The third half-plane is bounded by the line through a segment 
of S. If a segment Si E S has a visible endpoint pj^ and x is collinear with Si, 
then the triangle “degenerates” to the open line segment x^. We are interested 
in the regions in the plane in which the structure of the visibility star is constant. 

We simplify the presentation by introducing an additional “line segment”. 
Let D be an open rectangle containing the union of segments A, and let Sq be its 
boundary. We will simply call *5^0 a segment. This gives an extended collection 
of segments S' = { 5o } U Let A' = Sq U A. Each ray starting from any point 
in D, intersects A'. 

We need a compact description of the structure of the visibility star \stA{x), 
for any viewpoint x G D. Eor this purpose, we define a collection of identifiers, 
referring either to segments or endpoints. An identifier is an (integer) index 
subscripted with a p or an s, indicating an endpoint or a segment, respectively. 
We order the identifiers linearly as follows: 0^ < < • • • < < • • • < rup. 

Each point a G A' is assigned an identifier id(a) as follows. If a = p^, then set 
id(a) = ip. If a G Si — P, then set id(a) = ig. 



Vp^(x) = {aGA|Anxa = 0}. 




aeVp^ix) 
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A 






Vpa(^) 



p! 



Fig. 4. A visibility star 



Fig. 5. A view map 



We represent the structure of the visibility star by a tuple identifiers. Choose 
any closed disc centred at x disjoint with A. The boundary of such a disc is 
called the view circle^ denoted by Cx^ We give each point c G a label /(c), 
an identifier, as follows. For each point a G A' visible from z, compute the 
intersection c G H xa and set /(c) = id(a). 

The view circle Cx is a disjoint union of inverse images /“^(c/), for each 
identifier d. Each non-empty inverse image /“^(d), the subset of points in Cx with 
label g/, can be decomposed into its connected components. These components 
are either single points or open arcs. The view map of x, denoted with Vmp^(x), 
is a labeled circuit graph whose vertices are the components with their constant 
labels. We call vertices labeled with endpoint identifiers (ip) p-vertiees. We 
call vertices labeled with segment identifiers (A) s-vertiees. Edges of Vmp^(x) 
are defined by pairs of vertices, constant-label components, with intersecting 
closures. Eigure 5 shows the labeled view circle, inducing the view map, for a 
collection of four closed line segments having nine distinct endpoints. Labels 
of p- vertices are indicated on the dotted lines on the outside of Cx^ Labels of s- 
vertices are indicated inside the view circle between successive dotted lines. 

The view map Vmp^(x) can be represented using a tuple of labels encoun- 
tered when traversing all edges, starting with some initial vertex and some in- 
cident edge. Of all possible tuples, the lexicographically smallest one represents 
the view map. This representation does not depend on the direction (clockwise 
or counter-clockwise) in which the labels occur on the view circle. We identify 
the view map with this unique tuple. In the situation of Eigure 5 this gives 



The view map Vmp^{x) is a function of points x G D. Define points x^y ^ D 
to be equivalent if their view maps (labeled graphs) are isomorphic, that is, 
Vmp^(x) = Vmp^(p). This equivalence relation results in a partition of D into 
equivalence classes. This visibility partition is denoted by Q^(<S). If x and y lie 
in the same class Q G Qv{<S) of the partition, the visibility stars YstA{x) and 
Vst A{y) have the same structure. 
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The visibility partition is affine invariant: the partition for the affine trans- 
formed set S equals the affine transformed partition (including the labels). The 
reflection- visibility partition also has this property. A single class in the visibil- 
ity partition can have more than one connected component. In the example of a 
single segment S = {Si} with endpoints P = {pi,P 2 }, the open half-planes left 
and right of Si are the connected components of the equivalence class in Qv{S) 
having view map (Og, Ip, 1^, 2^). 

The visibility partition has the structure of an arrangement induced by a 
finite union of closed line segments. Each cell in this arrangement is a connected 
component of an equivalence class in the partition. As the viewpoint x moves 
continuously within changes occur in Vmp^(x). Each time such a change 
occurs, the set of vertices visible from x changes. The sets of viewpoints x on 
which changes in the viewmap occur form one-dimensional boundaries in the 
arrangement describing the visibility partition. 

We construct a collection of “event segments” for the view map. Let Es be 
the collection of (directed) edges in the visibility graph. That is, Es consist of all 
pairs of endpoint-indices (i,j) such that pj is visible from pi. We extend Es to 
a collection E'^ by also including the endpoint-index pairs of each segment in S. 
Given an endpoint pi^ sort all endpoints pj^ , with (z, jk) G E'^^on clockwise angle. 
This results in a list of endpoint identifiers ji, . . . ,jc- Let Sk be the segment- 
identifier of the segment visible from pi inbetween the angles of pj^ and 
relative to pi (where A; + 1 is modulo c). We construct event segments bounding 
the set of points in D from which pi is visible. 

Eirst, we define the collection of event segments Vi. Eor each k = 1, . . . , c, 
we include in Vi the closure of the visible part of segment Ss, (visible from pi). 
This includes parts of the special segment with index Og. 

Second, we construct a segment collection Bi connecting pairs of segments 
in Vi. Eor each k = 1, . . . , c, we construct a closed segment between the two 
intersections of ray(p^, ) with segments in S'. If these two intersection coincide, 

we include no segment in Bi, for that particular k. 

The third and last types of segments are extensions of segments in S. Con- 
sider each segment S in S having endpoint pi, and having another endpoint pj. 
Include in A^, the closed segment having endpoints pi, and the intersection of 
Tdiy{pi,pj) with [jVi that is closest to pi. 

The three types of segments result in the arrangement describing the visibility 
partition. Let V, B, and A, denote the unions of Vi, Bi, and A^, over all i = 
1 , . . . ,m, respectively. 

Theorem 1. The boundaries in the visibility partition are formed by the event 
segments: Uq 6 Q„( 5 ) dQ = {jV . 

Eigure 6 shows the visibility partition for four line segments having seven distinct 
endpoints. The union of V coincides with A. The segments of B are drawn 
dashed. The segments of A are drawn coarse dashed. The points where event 
segments meet are indicated as dots. The rectangle containing the segments is 
the “segment” Sq. 
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For a derivation of the worst-case complexity of the visibility partition, 
see [15]. The visibility partition Qv{S) corresponding to S can be computed 
as follows. First, we compute the visibility graph for <S, in 0(nlog(n) + k) time 
using algorithms by Ghosh and Mount [10] or Pocchiola and Vegter [22]. Using 
the visibility graph, the view map can be computed for each endpoint. A simple 
algorithm discovers the segment visible between successive visibility edges inci- 
dent to each endpoint by performing log(n) iterations over the visibility edges. 
Include the segments of S as visibility edges. For each (directed) visibility edge 
e = (p,q), such that no segment adjacent to q is visible to the left of q rel- 
ative to p, store a pointer r(e) to the visibility edge incident to q that turns 
right relative to e. If a segment adjacent to q is visible to the left of q (relative 
to p), we store a pointer to this segment in r(e). In each iteration we consider 
all edges e in the visibility graph. If r(e) is not a segment blocking the view to 
the left we replace r(e) by r(r(e)). Analogous, we maintain pointers 1(e), where 
the roles of left and right exchange. After 0(log(A;)) = 0(log(n)) iterations, we 
have found the segments that block the view directly to the left and the right 
of each directed visibility edge. This takes a total of 0((n H- k) log(n)) time. Us- 
ing this information, we can generate the total collection of event segments in 
0{n-\-k) time. Let v be the number of intersections in the collection of event seg- 
ments thus generated. Using randomised incremental construction we construct 
a trapezoidal decomposition of this collection, see Mulmuley [20] pp. 84-94, in 
0{{n-\-k) \og{n)-\-v) time. The arrangement defined by the event segments can be 
obtained by merging together trapezoids into polygonal cells. Thus, the visibil- 
ity partition, represented as an arrangement, can be computed using randomised 
techniques in 0((n k) log(n) + t’). This results in the following theorem. 

Theorem 2. The visibility partition of n segments has worst- ease eomplexity 
0{n^). Using randomisation, it ean he eomputed in 0{{n -h k)\og{n) + v) time, 
where k = 0{n^) is the number of visibility edges, and v = 0(n^ -\- k‘^) is the 
number of vertiees in the arrangement. 

4 Reflection Partitions 

In this section, we consider a stronger notion of visibility, resulting in different 
stars and partitions. We say that a point p G is trans-visible from a point 
X G if p is visible from x and both ray(x, p) and ray(p, x) intersect A. The 
trans-visible part of A is given by: 



The trans-visibility starTstA{x) is the union of all open line segments between x 
and the trans-visible part of A: 



TPyi(a:) = {a G Vp^(a:) | Anray(a,a;) ^ 0}. 




aeTp^{x) 
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Fig. 6. A visibility partition ^ re&ection star 



Based on trans-visibility, we can define a trans-view map, and a trans- visibility 
partition, see [15]. 

We say a point ^ G is reflection-visible from a point x, if y is trans- visible 
from X and the open segment between y and the reflection of ^ in x is disjoint 
with A. Observe that in contrast with visibility and trans-visibility, reflection- 
visibility is not symmetric. Define the reflection-visible part of A as follows: 



Rp^(x) = {x -\-v e Tp^(x) I {x -\- v){x — v) = 0}. 

Define the reflection-visibility star as: 

RstA(a:) = U{ {x -h v){x — v) \ X -\- V ^ Rp^(x) }. 

The reflection-visibility star equals the intersection of a trans-visibility star 
TstA(a:) with its reflection around x. Figure 7 shows a reflection-visibility star 
for six disjoint segments. The grey area, including the dotted segments, forms 
the reflection-visiblity star. 

Consider a view circle Cx centred at x. We label the view circle to describe 
the structure of the reflect ion- visibility star at x. We define a labeling I of a 
view circle Cx, that represents the structure of the reflection visibility star. We 
use polar coordinates, such that each point c G is represented as c = (<a,r), 
where r is the radius of Cx- Let L{x^a) be the line through x and (<a,r). Let 
e > 0 be smaller than the angle between any two endpoints. There are three 
cases: 

1. If L{x^a) intersects a visible point of 5 q, then /(c) = 0^. 

2. If L(z, a) intersects a visible endpoint p e P and L(x, a — e) or L(x, a -\- e) 
intersects a visible point of 5 q, then /(c) = id(p). 

3. In all other cases, set /(c) = id(a), where a is the visible point in AnL(x, a) 
closest to X. 
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Overlaps in the rules are resolved by choosing the minimum- index identifier. 

The labeling I defines a labeled circuit graph called the reflection view map^ 
denoted by Rmp^(x). Figure 8 shows the view circle along with the labels of the 
reflection- view map. The reflection- visible part is shown thick. The dashed lines 
are reflections of segments in the view point. 




Fig. 8. A reflection- view map 



P7 




Fig. 9. A reflection-visibility parti- 
tion 



The reflection- view map Rmp 5 (x) at a point x contains the structure of the 
reflection- visibility star RstA(a:) at a given point. Starting at some vertex and 
some initial edge, we obtain a tuple of labels representing the reflection-view 
map. Since this tuple repeats itself, we only take the first half. We choose the 
lexicographically smallest half-tuple as a unique representation of Rmp^(x). In 
the situation of Figure 8 this gives Rmp 5 (x) = (0^, 1^, 1^, 2^, 0^, 3^, 4^, 2^). 

We obtain a reflection-visibility partition by identifying points x^y G D if 
their reflect ion- view maps Rmp^(x) and Rmp^(^) are equal. When moving x, 
if the reflect ion- view map changes, then the set of reflection- visible endpoints 
(relative to x) changes: if we move x, either an endpoint identifier, a segment 
identifier, or an intersection identifier, appears or disappears. In case of an end- 
point identifier, the result is immediate. In case of a segment identifier, a change 
occurs only if one endpoint starts to occlude another one. If an intersection iden- 
tifier disappears it is replaced by an endpoint identifier. Since a change in the 
set of endpoints also implies a change in the reflection view map, we can con- 
clude the following. Each class in the reflection-visibility partition is a maximal 
connected subset of D in which a fixed set of endpoints is reflect ion- visible. 

We construct a collection of event segments for the reflection- view map. Con- 
sider an endpoint pi. Consider Vi as for visibility. Replace each segment in Vi 
that lies in Sq by a “degenerate” segment consisting of the point pi. Now build Bi 
and Ai using Vi just as in the visibility case. 

Consider a scaling transformation / that leaves pi fixed and which has a 
scaling factor of 1/2. That is, all coordinates relative to pi are multiplied by 
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1/2. To obtain the desired reflection-view map, replace all segments in Bi 
and Xi by their images under /. Like before, we construct unions of the segment 
collections over z = 1, . . . , m. 

Theorem 3. The boundaries in the re fleetion- visibility partition are formed by 
the event segments: UgeQrC^S) = \JV U\JBU\JX. 

Figure 9 shows a reflection-visibility partition. The complexity of the 
reflection- visibility partition is at most O(n^). For the i7(n^) worst-case lower 
bound, see [15]. To compute the reflection- visibility, we can use the same basic 
techniques to compute the normal visibility partition. 

Theorem 4. The re fleetion- visibility partition of n segments has worst-ease 
eomplexity 0(nf). Using randomisation, it ean be eomputed in 0{{n-\-k) log(n) + 
'L’) time, where k = 0{n^) is the number of visibility edges, and v = 0(n^ -h k‘^) 
is the number of vertiees in the arrangement. 

5 Computing the Reflection Metric 

As an application of the previously defined structures, we use them in computing 
the reflection metric. Let pa{x) be the area of the reflection star RstA(a:) for each 
z G see Figure 2 for an example. Observe that for points x outside the convex 
hull of A, this area is always zero. If we have two finite unions of line segments A 
and B, the reflection metric da is defined as: 

, / . m ^ /r2 \pa(x) - pb{x)\ dx 

^ ’ J^2^a.x{pA{x),pB{x))dx' 

The reflection metric can be generalised to finite complexes of d — 1 dimensional 
algebraic hyper-surface patches in d dimensions. For this, we refer to [16]. Here, 
we focus at the computation of the reflection metric for finite unions of segments 
in the plane. 

In Section 2, we emphasised the fact that the reflection metric is robust 
for deformation, blur, cracks, and noise. This property can be derived from the 
definition without much difficulty. Given a fixed viewpoint, the change in the 
area of the visibility star caused by each of the above effects is proportional. 
The area of the reflection- visibility star, pa{x), changes at most twice as much 
as the area of the visibility star. This pointwise behaviour of the p^-function is 
preserved as we integrate it, showing robustness of the reflection metric. 

Now, we apply the results from the previous sections to compute the reflection 
metric. We assume A and B are unions of ua and ns line segments, respectively, 
having at most k edges in their visibility graphs. The reflection metric can be 
rewritten as follows: 



2/r2 \r{x)\ dx 

Ir 2 \p{x)\ dx + \q(x)\ dx + 4 ^ \r{x)\ dx" 



dK{A,B) 
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where p{x) = Pa{x)^ q{x) = Pb{x)^ and r{x) = p{x) — q{x). The functions p, q 
and r are piecewise rational functions in two variables. With piecewise we mean 
that there is a finite number of triangles covering the support of the function, 
such that the restriction of the function to each such triangle is a rational func- 
tion in two variables. The functions p and q are quotients of polynomials of 
degrees 0(riA) and 0{nB)j respectively. The function r is a quotient of polyno- 
mials having degree 0{ua ub)- We adopt a model of computation in which 
the absolute value of a rational function in two variables, can be integrated 
over a triangular domain in 0{d) time, where d is the maximum degree of the 
polynomial numerator and denominator. 

The computation of the integrals of the rational functions p, q and r proceeds 
as follows. Let kA and kB denote the number of visibility edges corresponding 
to A and B respectively. First, we compute the visibility graphs of A and 5, 
taking times 0{nA^og{nA) + kA) and 0{nB^og{riB) + kB), respectively. Using 
the algorithm sketched in Section 3, the event segments that correspond to the 
reflection visibility partition, can be found in 0{sa \og{riA)) and 0{sb \og{riB)) 
time, where = ^{kA) and sb = ^{kB) are the number of event segments 
for A and 5, respectively. Then, we compute a trapezoidal decomposition for the 
union of both event segment collections in time 0{{sa sb) log{nA + + '^), 

where v is the number of intersections. We integrate the absolute values of p, q, 
and r by summing the partial integrals over all trapezoids (each trapezoid is a 
union of two triangles). In our model of computation, this takes 0{nA tlb) 
time for each trapezoid. Since the summation of partial integrals dominates the 
overall complexity, we arrive at the following result. 

Theorem 5. Let A and B each he unions of ua and bb segments, respec- 
tively. Using randomisation, the reflection distance d^{A,B) can he computed 
in 0{r{nA~\-nB)) time, where r is the complexity of the overlay of the reflection- 
visihility partitions of A and B. 

6 Conclusion 

We presented a new metric for pattern matching, the reflection metric. This 
metric is invariant under the group affine transformations and can therefore be 
used for affine shape recognition. The reflection metric is robust for pattern- 
defects such as deformation, blur, cracks, and noise. It can be generalised to 
finite unions of algebraic hyper-surface patches in any dimension. 

The reflection metric is defined in terms of reflection visibility. Trans- visibility 
and reflection- visibility are stronger than visibility. Reflection- visible points are 
always trans-visible, and trans-visible points are always visible. We analysed 
the partitions corresponding to the visibilities, starting at normal visibility, and 
proceeding with reflection- visibility. New types of events emerged, making the 
resulting partitions more complex. 

Constructions show that the worst-case combinatorial complexity for each 
of the types of partitions is Q(rA). Using randomised incremental construction, 
each of the corresponding arrangements can be built in 0{{n -h k)\og{n) -h v) 
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time, where k is the number of visibility edges, and v is the number of in- 
tersections in the arrangement. The structure of reflection- visibility partitions 
can be used to compute the reflection metric for two collections of segments in 
0(r(riA -\-tib)) randomised time, where r is the complexity of the overlay of two 
reflection-visibility partitions. Limiting the endpoints of segments to 1? would 
not change the complexity of the partitions, since any finite arrangement in 
can be transformed into an arrangement in 1? . Limiting the segment slopes to 
set of allowed slopes does not affect the worst-case complexity of the visibil- 
ity partition, but does affect the complexity of computing the metric, since the 
maximal polynomial degree of the piecewise function p is linear in the number 
of different segment slopes. 
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Abstract. This paper deseribes a praetieal approaeh to mathematieal 
morphology and ways to implement its operations. The first ehapters 
treat a formalism that has the potential of implementing morphologieal 
operations on binary images of arbitrary dimensions. The formalism is 
based on sets of strueturing elements for hit-or-miss transforms whereas 
eaeh strueturing element aetually deseribes a shape primitive. The for- 
malism is applied to two and three dimensional binary images and the 
paper ineludes strueturing elements for topology preserving thinning or 
skeletonization and various skeleton variants. The generation of shape 
primitive deteeting masks is treated as well as their applieation in seg- 
mentation, aeeurate measurement and eonditions for topology preserv- 
ing. The formalism is expanded to four-dimensional images and elabo- 
rates on the extension of 3D skeletonization to 4D skeletonization. A 
short exeursion was made to methods based on 3D and 4D Euler - elus- 
ter eount methods. 



1 Introduction 

In this paper a practical introduction to Mathematical Morphology applied on binary 
images of arbitrary dimensions is presented. Mathematical Morphology applied on 
binary images is also known as Cellular Logic Processing. Whereas the Mathematical 
Morphology sprouts from a mathematical domain. Cellular Logic Processing has its 
roots in binary image processing on massively parallel Cellular Logic Machines. The 
approaches are, however, similar and have their own charm. The method and formal- 
ism as described have been developed in parallel with the design of an architecture for 
real-time image processing [1]. However, the method is (also) implemented in soft- 
ware. In the next chapter, the use of sets of structuring elements is explained and op- 
erations in 2D and 3D. In chapter three, the connectivity paradox and topology 
breaking properties are studied for N dimensional images, followed by skeletonization 
using shape primitives. The generation and verification of shape primitives is ex- 
plained, and in chapter four, their extension to the fourth dimension. 
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2 Sets of Structuring Elements 

2.1 Hit-or-Miss Transformations 

The field of eellular logie image proeessing and mathematieal morphology is exten- 
sively deseribed in several works (Golay 1969 [2]; Preston 1970 [3]; Serra 1982 [4], 
1988 [5], Giardina 1988 [6], Heijmans 1994 [7], Soille 1999 [8], and many others). A 
basie operation in mathematieal morphology is the Hit-or-Miss transformation, and as 
a starting point for this seetion we will quote its definition by Serra (1982): 

The Hit-or-Miss transformation is a point-by-point transformation of a 
set X, that is performed in the following way: 

Choose and fix a structuring element S, the datum of S being two sets 
s'" and . Suppose S is centered at the point x of X, denoted by 
S = , then a point belongs to the Hit-or-Miss transformation 

Y ^ X ® S ofX,if and only if is included in X and is included 
in the complement X"" ofX, or: 

To show the similarities, this definition in the mathematieal domain will be mirrored 
to the domain of binary image processing with its convolution kernel-like approach, 
by associating an origin and a vector structure with it. 

We associate with the universe -the union of X and W -, a square tessellated image 
X of size m X n, and with the elements of the universe, the pixels of image X. We can 
then associate with the set X, the foreground of image X and with the set X"" its back- 
ground. Likewise for the sets Y and Y"" , and image Y. As structuring element S con- 
sists of two disjunct sets and S'^ , then its universe is formed by the union of the 
disjunct sub-sets S'" , S'^ and S'" , S'" being the complement of the union of and S'^ . 
We associate with the union of , S'^ and S'" , a square tessellated image S of size i x 
j, and with the elements of , S'^ and S'" , the pixels in the image S. Then we can asso- 
ciate with the set S'" the foreground pixels, with the set S'^ the background pixels, with 
the union of S'" and the do-care pixels and with the set S'" the donT care pixels. In 
our case of binary image processing, the foreground is defined to have the value 
TRUE or 1, the background to have the value FALSE or 0, and the don’t cares to have 
the value D. Hence, don’t cares are used to implement the unrestricted set S with im- 
age S having a fixed size and shape. 

Let X, y and s be the elements (pixels) of X, Y and S, and let the size of S e.g. be i = 
j = 3, then the 3 x 3 structuring element S consists of s°,s\...s^ the east (E), north-east 

(NE), north (N), , south-east (SE) neighbours of the central (C) pixel s^ itself, 

and an equivalent 3x3 neighbourhood M,^ around pixel x,^ , consists of x° , xj^....xj : 
the E, NE, N, ,SE neighbours of the pixel x,^ and x,^ the central (C) pixel Xj^ itself 
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The point-by-point transformation Y ^ X<S)S can now be implemented for binary 
image processing with the neighbourhood transformation Y ^ X = S , or 
{Vk:yj^ ^ = S} with and S centered around pixel , and with = the symbol 

for an inexact match. Informally, the inexact neighbourhood matches 
{Vk:yj^ ^ = S} can be described as follows: 



If for any pixel x,^ in an input image X, its neighbourhood M,^ matches inexactly 
with a structuring element S, the pixel of output image Y is set to one. If does- 

n 7 match S, y,^ is set to zero. 



In the inexact neighbourhood match the foreground pixels in S should match with 
foreground pixels in M,^ at the same positions AND the background pixels in S should 
match with background pixels in M,^ at the same positions, whereas in the don’t care 
positions of S a match is not required, or: 




Fig. 1. Structuring element S and its implementation with an image S with fixed size and shape 



We now extend this inexact neighbourhood match in two ways, first to: 



( 3 ) 



or: If for any pixel Xj^ in an input image X, its neighbourhood matches one 

mask Sf from a set of masks the pixel y^ of output image Y is set to one, else to 
zero. 

Meaning that the union of all mask matches is taken. 



And further to: 



Yk^ 







( 4 ) 
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or: If for any pixel in an input image X, its neighbourhood matches any 

mask Sf from a given set of masks the pixel of output image Y is set to one, 
else set to zero, OR if its neighbourhood matches any mask Sf from a given set of 
masks , the pixel is set to zero, else to one. 



The image transformation Y ^ X = S is now implemented with a set of masks S as 
strueturing element, eonsisting of a subset (the SET-masks) and a subset (the 
RESET-masks), one of whieh may be empty. This means that either a pixel y,^ is set 
to zero, if one of the RESET masks fits, or the pixel is set to one, if one of the SET 
mask fits, where the SET masks are ehosen to dominate over the RESET masks. 

Finally, a seeond input image Z ean be used to loeally enable/disable the transfor- 
mation, yielding the dyadie operation Y ^ /({X, Z} = S) , or: 







(5) 



If a "mask-bit" z of a mask of the set S is set don’t care, the transformation is enabled. 
If z is do-care, then if the pixel of Z matches with z , the operation is disabled, 

else enabled. Operations that use Z to locally mask-off or seed operations, are the 
propagation operation and the anchor-skeleton. 

Note that the term structuring element is used in broad sense. We will also use the 
word kernel, mask or mask set (all indicated with S) to indicate a structuring element. 
In many cases, a mask set can be transformed into a set with a single mask, and vice 
versa, without effecting its functional properties. This will be demonstrated in the 
sequel. A theoretical treatment of the manipulation of sets of structuring elements, 
finding the minimal set (the basis) and deriving the equivalent set that is spanned by 
scanning an image with a sequence of structuring elements, can be found e.g. in the 
article of Jones and Svalbe (1994) [9], which starts from Matheron’s theorem 
(1975) [10] that "a simple algebra of union intersection and translation suffices to 
implement all general translation invariant and increasing set mappings". 

An operation on an image X can be done by performing the Hit-or-Miss transfor- 
mation with a mask set S once (6), twice or more (7), or infinite (8: the n-pass form): 

Y ^ (X = S) (6) 



Y^((((X = S) = S) = S).... = S) 



(7) 



Xo^X; {Vi:X.,,^(X,sS)];Y^X^ (8) 



In the latter case an iterative scheme is used, which can be implemented by trans- 
forming the image until it is idempotent under the transformation. 
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In many cases practical use can be made of the intermediate results in the output 
image. This is called (spatial) recursion. For instance, if the transformation is per- 
formed by a raster scan over the image, i.e. from top-left to bottom-right, the fact that 
the neighbours NE, N, NW, W of the pixel yj^ have already obtained a new value can 

be utilized. For this purpose, the neighbourhood Mj^ of pixel will be extended by 
Yk ? Yk ? Yk ? Yk ? the newly obtained values and the structuring element S will be ex- 
tended by: 




Fig. 2. Drawing conventions for Y ^ /({X, Y, Z} = S) Here shown for 2D images and 3 x 

3 neighbourhoods. The LN is taken from input image X, the RN from output image Y. For 
dyadic operations a second input image Z is used 

So now both the Local Neighbourhood (LN: the original value Xj^ and its neighbour- 
hood x^,xj,,x^,....x^ ), and the Recursive Neighbourhood (RN: the newly obtained 
values y^\ yf , yf , yf ), can be used in the neighbourhood matching procedure. The 
class of cellular logic operations performed with this extended neighbourhood will be 
referred to as recursive neighbourhood operations (RNO) in contrast with the local 
neighbourhood operations (LNO) that use the normal neighbourhood only. Note that 
both classes have a dyadic form for which S is extended with an element z and 
with a pixel of a second image Z, yielding: 

Y^/({X,Y,Z}sS) W 

Figure 2 pictures the matching process with a single mask from a set S, whereas the 
neighbourhood M,^ is extracted from the three different images. Note that the Recur- 
sive Neighbourhood that can be realised is implementation dependent, and that when 
using a software raster scan over the image, it is beneficial for RNOs to scan in all odd 
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scans from top-left to bottom-right and in all even scans from bottom-right to top-left. 
In this case, the RN should be transposed to match with the scan direction. 

2.2 Basic Morphological Operations in 

The most frequently used known Loeal Neighbourhood Operations are: Erosion, Di- 
lation, Contour extraetion. Spot noise removal (pepper and salt removal) and Majority 
vote (binary rank filtering). The two most frequently used reeursive and iterative eel- 
lular logic operations are: Conneeted eomponent extraetion or Propagation and To- 
pology Preserving Thinning or Skeletonisation. Figures 3 and 4 show some strueturing 
elements in the form of mask-sets for simple operations in X 2 and X 3 . 



SET SET SET SET 




Copy Move NE Erode 8 eet Erode 4cet 



SET SET SET SET SET 




Copy Move NE Erode 26cct Erode 1 8’cct Erode 6 ccl 
Fig. 3. Structuring elements for simple operations in X 2 and X 3 

The structuring elements in Figure 3 are LNOs based on single masks of the SET type, 
meaning that the eentral element is set to foreground if the 3^ neighbourhood matches, 
else to baekground. 

The operations for X 2 are: copy the image, move the image to North-East, erode an 
8 conneeted contour from all objects in the image, erode a 4 eonneeted contour from 
all objects in the image. Note that the "erode 8 eet mask" is in faet a template that 
matehes on foreground area in X 2 , it does neither match on background nor places 
where it touches the contour of an object in any of the directions N,E,S,W. Erode 4cet 
does the same but in addition will not match on pixels in the NE, NW, SE, SW diree- 
tions. Because of the template matching aspect of the masks, the principle can be 
expanded to X^^ as the masks for X 3 show. 

Figure 4 shows the dilation operation: It comprises a mask that matches on back- 
ground area in X 2 and on baekground volume inX 3 while not touching objects. Where 
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the mask matches, the background is reset to background, where the mask does not 
match, the background is set to foreground. From the contour detection mask set in 
X 2 and X 3 the first mask matches on foreground not touching the object contour, 
which is reset to background, and the second mask matches on all background which 
is also reset to background. Both masks do not match on the object contours, which 
will hence be set to foreground. A result is shown for X 3 , where on a cubic volume 
with a ball shaped hole, de 26 connected contour surface was detected using the mask- 
set ’’detect 26cct”. Note that for a better view of the result, the cube was cut open short 
behind its front face. Finally, the mask-set for Pepper and Salt removal for X 2 shows 
that the first mask repairs the pepper and the second mask repairs the salt noise. 



RESET RESET RESET RESET SET 




Dilate 4cct Detect Sect Spot noise removal 



RESET 




RESET RESET 




O 



V 



■V 






Dilate 26cct Detect 26cct 




Fig. 4. RESET masks, masks sets and mixed SET / RESET sets. Opaque elements are don’t 
cares, light grey are background elements and dark grey are foreground elements 

The relation between the SET and RESET forms of the mask-sets is given by the 
Boolean algebra, i.e. the theorem of De Morgan (1847)[11]. Defining mask sets using 
only SET conditions is possible, although in some cases this is not very parsimonious. 
Eor example, the transformation of dilate_4cct from a conjunctive canonical form to a 
disjunctive canonical form is given by: 

(CaNaWaSaE) = (CvNv WvSvE)^ 
(CaN)v(CaW)v(CaS)v(CaE)v(C) 

Dilate_4cct, first expressed as the ’’erosion of the background”, expands to a set of 
five masks, expressing the dilation as: ’’copy the image in directions N, W, S, E and 
take the union of all results and original image”, and finally to a set of 5 masks that 
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indicate that "any background pixel adjacent to the foreground is set to foreground”. 
The discussion above leads us to the observation, that mask sets can be minimized by 
transforming them from the disjunctive canonical form to the conjunctive canonical 
form, or vice versa. And, that mask-sets can be manipulated using Boolean algebra 
and minimisation. 

Figure 5 shows an example of a dyadic Recursive Neighbourhood Operation in 
X2 and X3 ; the propagation operation. Objects in an image are recursively dilated 
(the first mask), wherever foreground in image Z is found and background in X was 
found (the second mask). 



RESET RESET 



□ ■ 



N. 



■V 



Propagation Sect 



J 



RESET RESET 





] ^ 


^ — 7 






^ 7 




1 ( 


^ 7 






0 






7 






^ — 7 






0 




^ L 


7 



Propagation 26cct 



Fig. 5. Dyadic RN Operation in X 2 and X 3 ; the propagation operation 



2.3 Connectivities in N Dimensional Images X^^ 

A set is connected if each pair of its point can be joined by a path along points that are 
all in the set. Within a square tessellated two-dimensional binary image X2 with ob- 
jects on a background, there are two possibilities for the connectivity of object and 
background pixels. The objects can be chosen to consist of pixels connected with one 
or more of their 8 neighbours at (E, NE, N, NW, W, SW, S, SE) or they can be chosen 
to be 4-connected, with one or more of their neighbours at (E, N, W, S). See also 
Rosenfeldt and Pfaltz (1966) [12] for the connectivity paradox. This paradox extends 
to higher dimensions. Note that we define the connectivity of an object as the highest 
connectivity between any two adjacent elements within that object. 

Tessellation in higher dimensions is also known as honeycombing. Coxeter [13] 
proves in his book on regular polytopes, that the only regular honeycomb (a single 
regular identical cell on a lattice) that exist in all dimensions N is the cubic honey- 
comb. In addition in 2D the tessellation with hexagons exist and in 4D two types of 
honeycombs exist (each-others dual), however, not with their vertices on a lattice. 
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Although in 3D the Dodecahedron and Icosahedron exits, they cannot be used for 
honeycombing in 3D. C60 molecules or "bucky-balls" are truncated Icosahedrons, 
made from hexagons and pentagons. Above dimension 5 only the equivalents of the 
regular Tetrahedron, Octahedron and Cube exist as polytopes. Which leaves hypercu- 
bic honeycombing as the only way to extend our method to higher dimensions. 

Let us define an N-dimensional binary image as a bounded section of an equi- 
distantly sampled Euclidean space of dimension N, with element values {0,1}. The 
elements of X2 are referred to as pixels, the elements of X3 as voxels, of X^ and 

higher as hyper-voxels. Let be an N-dimensional hyper-cubic neighbourhood 
with odd size n = 2k+l. For many morphological operations 3^ neighbourhoods are 
used (k=l, n=3), so is abbreviated to M2 for a 3x3 and M3 for a 3 x 3 x 3 neigh- 
bourhood, etc. Connectivities can be derived by counting the number of image ele- 
ments around a central element, within a hypersphere and within the neighbourhood. 
E.g. in X2 there are 4 pixels within radius 1 and 8 pixels within radius V2 . In [1] and 
[14] it was shown that the connectivity between image elements can be derived by 
counting E, the number of elements on a hyper sphere with radius d around an image 

element that lie within the neighbourhood M” . E can be calculated by permuting the 
neighbourhood element co-ordinates, and is given by: 



E= 



N! 

k 



2(N-no) 



n(v) 



( 11 ) 



From E, we can derive V, the total number of elements within the hyper-sphere within 
M^ and hence the connectivity G = V-1. This yields as connectivities in a 

3^ neighbourhood for X2 : {4,8}, for X3 : {6, 18, 26}, for X4 : {8, 32, 64, 80}, etc., 
in a 5^ neighbourhood for X2 : {12, 20, 24}, for X3 : {32, 56, 80, 92, 116, 124}, etc. 



2.4 Connectivity Paradox in X 3 

Connectivity rests on connected paths between pairs of points in a set. The paradox 
exists because with the sort of operations we do on the sets (like segmenting regions 
into objects) we demand that paths are boundaries. I.e. they cannot be crossed -nor 
tunneled- without breaking the path. In topology, the Jordan Curve theorem states that 
a simple closed curve lying in a plane divides the plane into precisely two regions and 
forms their common boundary. [15]. This theorem extends to X^ . 
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Fig. 6. Jordan Curves applied in X2 and X3 



Consider a set of 4 points from a larger universe X 2 with two sets eaeh of two points 
(white and grey), see figure 6 . We will call this 2x2 neighbourhood a tile. Clearly 
when the grey set forms anywhere in the universe a closed boundary, incorporating the 
connection between the two grey points of the neighborhood, the white set cannot be 
connected. When we define that in a neighbourhood all points of a set directly adja- 
cent to the boundary of the neighbourhood are assumed to be connected to any other 
point of the same set over that boundary, then for figure 6 b this means that if the grey 
set is connected, the white set cannot be connected. If we assume closure over the tile 
boundary, we can say that two points each at another side of a boundary cannot be 
connected. So we must choose, either white is face and edge connected and grey is 
only face connected, or vice versa. 

We can extend this to higher dimensions, but then tiles are 2^ neighbourhoods and 
closed boundaries are topologically equivalent to (hyper) spheres. Note that we con- 
sider connections over the tile boundaries now separately for each dimension. Figures 
6 c and d shows tiles in X 3 . Figure 6 c is a topological equivalent of a sphere when we 
assume that the grey set is 26 connected and the white set is not. The white set must 
then be 6 connected. Figure 6 d is a topological equivalent of a sphere when we as- 
sume that the grey set is 1 8 connected and the white set not. The white set must then 
be 6 connected, while we don’t care about the vertex connections. Vice versa, when 
we are sure that the white set is 6 connected, we might as well say that the grey set is 
18 connected, as we don’t care about the vertex connections. We can now make the 
observations that: 

• The lowest (face) connectivity in any image X^ is always a perfect boundary, it 
never leaks, all higher connectivities do. The reason is that a pair of face connected 
elements has unit extends perpendicular to its connection axis in all remaining di- 
mensions. 

• Usually we are interested in the properties of the foreground objects, the curvature 
of objects being one of them. The higher the connectivity, the smaller radii can be 
made, as can be imagined by observing the radii of 8 and 4 connected curves in X 2 

(1, V 2 ) and 26, 18 and 6 connected rings in X 3 (1, V 2 , Vs ). If we assume that 
that the centers of the image elements represent sampling points of a physical phe- 
nomenon, then, with the same sampling density, smaller structures can be repre- 
sented more accurately using a higher connectivity. This gives a preference for 
taking the highest possible connectivity as the foreground connectivity. 
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Consequently; A good choice is to take the lowest connectivity as background as it 
never leaks and the highest connectivity as foreground as it provides us with the 
highest bending radius. 



3 Skeletonization in 

In the previous seetions, we have deseribed a general prineiple for morphologieal 
operations on eubie tessellated binary images , based on the seanning of the image 
with a set of masks that eomprise the strueturing element. The masks eontain fore- 
ground, baekground and don’t-eares. On eaeh element of the image an inexaet mateh 
between all masks of the set and the neighbourhood extraeted from the image is per- 
formed, whereas the result written to the output image is the union of all matehes. 

So far, only erosion of foreground and baekground and matehes on single elements 
(pixels, voxels, ...) have been used. Erosions ean be deseribed as a mateh on fore- 
ground area in X2 and on foreground volume in X3 . In this ehapter, skeletonization in 
X^ is treated. Skeletonization ean be seen as eonditional erosion [ 16 ]. Figure 7 shows 
that in X3 a volume is eroded to a curved surface -the surface skeleton-, where after 
the surface is eroded to a space curve -the curve skeleton-. Apparently, for skeletoni- 
zation the condition for the erosion is that surfaces, or curves, should not be eroded. 
For those conditions, we need the notion of shape primitives in X^ . 





Fig. 7. Original, contour, surface-, and curve skeleton of a eube with hole in X 3 



3.1 Shape Primitives in X^^ 

In X^ we can identify shape primitives with a certain intrinsic dimension N . For 
example, in X2 we can distinguish points ( N =0), curves ( N = 1) and surfaces ( N 
= 2 ), in X3 points, space curves, curved surfaces and flat volumes ( N = 3 ), and in 
X4 we can distinguish points, space curves, curved surfaces, curved volumes and flat 
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hyper- volumes (N =4). Objeets in ean now be eonsidered as made of any arbi- 
trary eombination of shape primitives with intrinsie dimensions (0 < N < .N). 

Figure 7 shows objeets only eonsisting of volume, or surfaee, or spaee eurve. It was 
already remarked that the ( 6 , 18', 26 eonneeted) erosion eonditions for X 3 mateh on 
volumes exeept for the outer boundary of those volumes. Consequently, they are the 
shape primitive deteetors for volumes in X 3 . The eontour deteetion operations "De- 
teet 6 eet", "Deteet 18'eet", and "Deteet 26eet" were applied on a hollow volume (ob- 
tained by dilating a single voxel with 26 eonneeted surfaee eontours obtaining a 
sphere, and XOR-ing this with a eube). See figure 8 . 




Fig. 8. 6, 18' and 26 connected surface contours in X3 

(The cube's front slices have been removed for better insight). All resulting surfaces 
prevent the leaking of 6 connected background. The first result shows a surface that is 
entirely made of face connected voxels, which prevents leaking of face, edge and 
vertex connected voxels. Therefore, the background is allowed to be 26 connected. 
The second result shows a surface that is entirely made of face connected voxels, 
which prevents leaking of face and edge connected background but allows leaking of 
vertex connected background. Therefore, the background is allowed to be 18 con- 
nected. The third result shows a surface that is entirely made of face and edge con- 
nected voxels, which allows leaking of edge and vertex connected background. So the 
background is should be face ( 6 ) connected. 

In order to detect primitive shapes such as space curves, curved surfaces and vol- 
umes, we need shape primitives that match on all possible incarnations of those primi- 
tives shapes. 

3.2 The Generation of Shape Primitives in X 3 

Shape primitives for space curves can be found by remarking that an open (non circu- 
lar) space curve in a 3^ neighbourhood consists of a central voxel and two mutually 
non touching neighbours. Figure 9 shows all possible configurations for 6 and 26 
connected space curves generated by this rule. 
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Fig. 9. The 6 connected space curve primitives and the 26 connected space curve primitives 



Note that we omitted all rotated and mirrored versions and that non touehing is de- 
fined for a eertain eonneetivity. Henee, the seeond mask from the 6 eonneeted set is 
not present in the 26 eonneeted set. The set of 1 8 eonneeted spaee eurves is a subset of 
the 26 eonneeted set. 












Fig. 10. The 6 Connected eurved surfaee primitives 



In a similar way, the surfaee primitives ean be found. They ean be generated by a 
eentral voxel eneireled by a non-touehing elosed spaee eurve. For a 6 eonneeted sur- 
faee set, the eentral voxel should be eneireled by a 6 eonneeted spaee eurve (figure 
10), for a 26 eonneeted set by a 26 eonneeted spaee eurve (figure 1 1). 






















Fig. 11. The 26 connected curved surface primitives 



Again non touching is defined for a certain connectivity. The generation procedure is 
a recursive search that generates all possible occasions. After this, all identical, mir- 
rored and rotated variants are removed and the set is logically reduced to a minimal 
covering set [17]. Note when generating the surface primitives, their voxels stay in the 
18 connected neighbourhood of the central voxel in both cases. The vertex connected 
positions of the central voxel are apparently don’t care. Observe, that in the 6BG - 
26FG system, the space curves roam over the 26 positions, the curved surfaces over 
the 1 8 connected positions, and the voxels of the volume detection mask are found on 
the 6 connected positions. Consequently, we can state that in a 6BG-26FG system the 
space curves are 26 connected, the surfaces are 18 connected and the volumes are 6 
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connected. In a 26BG-6FG system the space eurves are 6 conneeted, the surfaces 18 
eonneeted and the volumes 26 eonneeted. 

If we have eompound (bifureating) objeets, such as forking space eurves, splitting 
surfaees, surfaees with sprouting curves ete., we must introduee don’t eares. We ean 
do this, because we ean be more preeise in our foreground-baekground paradox. We 
ean state this paradox now as follows: 

1. A foreground curved surface is threatened to be pierced by a background space 
curve. 

2. A foreground space-curve is threatened to be broken by a background surface, 
breaking the connectivity of the curve. 

We ean now merge the sets. Rule 1 leads to figure 12, a merging of the sets from fig- 
ures 11 and the first set of figure 9. Rule 2 leads to figure 13, a merging of the sets 
from figure 10 and the seeond set of figure 9. The set from figure 12 (when augmented 
with rotated and mirrored variants) ean be used to identify any surfaee in an image 
X 3 . The set of figure 13 can be used to identify any curve in an image X 3 . For all the 
masks hold, that if the central voxel swaps its value, e.g. from foreground to baek- 
ground, the eonnectivity of the foreground is broken and the topology changes. As 
each mask hits individually, it can be tagged with a contribution to the length of a 
eurve or to the area of a surfaee. Hence, surface area and eurve length’s of objects can 
be measured more accurately then simply counting voxels. [18]. 




Fig. 12. Shape detectors for curved surfaces "Surf26' 
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3.3 Erosion with Shape Primitives as Conditions: Skeletonization 

The main application of the shape detectors is that they can be used as conditions for 
topology preserving erosion, or skeletonization. The method of chapter 2, iterating 
over an image X 3 with the erosion mask ”Erode26ccf’ of figure 3 and the sets of 
figure 12 (Surf26) and 13 (Curv26), yields a skeleton. Erode26cct erodes surfaces 
from volumes (it hits only on the core of the volumes, not on their boundaries). The 
set Curv26 prevents the erosion of surfaces (the masks only hit on the core of the 
surface, not on the surface boundaries). The set Surf26 prevents the erosion of curves 
(the masks only hit on the kernel of the curves). Consequently, as volumes, surfaces 
and curves cannot be eroded from their kernels, they are eroded from their boundaries. 
So only closed surfaces and curves will remain. 

To prevent the erosion of boundaries, from the set Curv26 a new set Curv26e is 
made that contains all surface edge situations. Similarly, a set Curve26e can be made 
from the set CURV26 containing all curve end situations. A simple procedure can be 
followed. A surface extends in 4 directions. Systematically one and two directions can 
be disabled by setting them to background. A curve extends in two directions, one of 
which can be disabled by setting it to background. This yields 87 unique configura- 
tions for the surface edges and 3 possibilities for curve ends. 

A final remark should be made on the detection of two element thick structures. 
They cannot be detected properly in a 3^ neighbourhood. To circumvent this, the 
recursive neighbourhood must be used to detect foreground-background changes. 
However, only for the Surf 26 and Curv26 sets and not for the erosion mask and the 
Surf26e and Curv26e masks! 

A variety in skeletons can now be made (see figures 14 and 15) 
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Fig. 14. OriginaP, surface skeleton, curve skeleton and skeleton without curve ends 

The set {Erode26eet, Surf26, Surf26e and Curv26} was used to obtain the surfaee 
skeleton. The set {Erode26eet, Surf26, Curv26 and Curv26e} was used to obtain the 
eurve skeleton. The set {Erode26eet, Surf26, Curv26} was used to obtain the last 
skeleton of figure 14. All masks from all sets ean be used in parallel. However, the 
loeal situation in the image determines whieh masks are aetive. E.g. if the image does 
not eontain voluminous and surfaee objeets anymore, only the Curv26 set is aetive, 
preventing the breaking of eurves that are eroded from their ends. 

An important observation is that the masks that prevent the breaking of the topol- 
ogy for a eertain intrinsie dimension N , are the erosion conditions for objects with 
that intrinsic dimension N . Consequently, the mask Erode26cct prevents breaking the 
topology of a (flat) volume and the set Surf26 incorporate the erosion conditions and 
metric (!) for the curved surfaces. These metrics, due to the mask Erode26cct and set 
Surf26, are not Euclidean, which results in deviations of the skeleton from the medial 
axes of the original object. When the skeleton is needed for measurement purposes, 
the sets Surf26, Surf26e, Curv26, Curv26e cannot be used in parallel, but must be 
applied sequentially in the following way. A constrained Euclidean distance transform 
[19] should be performed on the original volume, where-after the surface skeleton 
should be made, eroding voxels in an order that depends on their distance to the vol- 
ume boundary. Subsequently, on the surface skeleton, a constrained distance trans- 
form should be made, after which the curve skeleton should be made eroding voxels in 
the order of their distance to the surface edge. 

Figure 15 shows the use of a curve skeleton {Erode26cct, Surf26, Curv26} in path- 
finding through a maze. Start and endpoint are inserted in image 7 } and using an 
extra mask anchor the skeleton onto those two points. 

For Robot Soccer [20] a similar anchor skeleton was used in the "Dutch Team" to 
quickly plan the collision free path from start to goal of an attacker robot. The im- 
age X 3 represented the universe of the robot with two place dimensions (x, y) and one 
time dimension (t). Manipulating both the erosion metric and the mask sets made it 
possible to prevent the time from running backwards and setting a speed maximum for 
the robot. If the orientation of the robot should be taken into account also, the problem 
should be solved in X 4 (x, y, 0 , t). 



^ The real original image (from Toriwaki) eontained two holes, giving rise to two elosed sur- 
faees at the skeleton near the wrist. We elosed the holes beforehand, for didaetieal reasons. 
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Fig. 15. An anchor skeleton is used to solve a path finding problem 

3.4 Verification 

The method deseribed in this paper lends itself very well for extension into higher 
dimensions. Provided the approach is valid. This was verified by comparing the out- 
come of the masks for X 2 with the Hilditch [21] skeleton and outcome of the masks 
for X 3 with the approaches of Lobregt[22] / Toriwaki [23] and Malandain [24]. They 
were checked by comparing and found to have an identical outcome in all 2 ^ and 2 ^^ 
possibilities of the 3^ and 3^ neighbourhood. Note, however, that in all three methods 
only the curve skeleton could directly be made, as the conditions can not be split over 
surface and a space curve sets. Hence, no accurate Euclidean variants could be estab- 
lished with those methods. 

To prepare for verification in X 4 we extended the Euler count (Eobregt) / cluster 
count (Toriwaki) method to 4D [25]. As the Euler-sum[26] in 4D is always zero, we 
solved this by embedding 4D objects in X 5 to calculate the 5D Euler number. This 

number changes due to 3 events: ID tunnels (N =1), 2D tunnels (N =2) and 3D tun- 
nels ( N =3). As the latter cannot occur in X 4 , with Euler-, and fore- and background 
cluster counting, the number of equations just equals the number of unknowns and a 
topology change can be detected. As the Euler sum applied to objects in X 5 counts 
also the possible 3D tunnels, fore- and background cluster counting is now not enough 
to detect the cause of change in the Euler sum. Consequently, as fore- and background 
cluster counting is all we can do in any dimension, the Euler/cluster count approach is 
only applicable in X 3 and (using X 5 ) in X 4 , making X 4 the limit. 

We generated test images in X 4 based on circular closed objects of various intrin- 
sic dimension, such as 4D equivalents of hollow balls, circles, etc. and contaminated 
them with foreground noise. The 4D skeleton showed to be robust. 
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4 Shape Primitives in X 4 

The methods deseribed in this paper extend straightforward to , espeeially for the 
basie operations sueh as erosion, dilation, eontour extraetion, propagation ete. in- 
volving only a few masks. To get an impression how the interaetion between fore- 
ground and baekground shapes primitives and their intrinsie dimension works out in 
X 4 we show in figure 16 how the paradox of foreground and baekground shape 
primitives works out. We define: 

Topology breaking of a foreground shape primitive of dimension N is done by a 
background shape primitive of intrinsic dimension N-N (and vice versa). 

For example, a spaee eurve eannot be broken by a point. As we only have foreground 
and baekground, the baekground point must be eonneeted to other baekground. This 
eannot be a line, beeause the foreground eurve leaves two degrees of freedom e.q. 
intrinsie dimensions open. For a simplified view on the matter in X 4 , we assumed in 
figure 16 lowest connectivity for fore- and background and the existence of flat ob- 
jects only. Figure 16 shows five sets of shape primitives: Hyper-volume threatened to 
be broken by a point, volume threatened by a line, plane threatened by a plane, line 
threatened by a volume an point threatened by a hyper-volume. It indicates that be- 
cause we consider the topology breaking as the union of a number of individual 
events, we do not run into the problem of the Euler-Cluster count method, that cannot 
make a distinction between the occurrences that lead to a change in the Euler number. 

On top of the paradox of the shape primitives we find the connectivity paradox. 
Table 1 indicates the relation between a dimension, its shape primitives and their 
associated connectivities. We distinguish CE: Connectivity to the central Element, 
NN: number of neighbours required for this shape primitive, RC: Recursive connec- 
tivity. 

The generation of the various sets follows the same lines as was done for X 3 . A 
shape primitive of intrinsic dimension N is made by encircling the central element 
over the positions CE, by shape primitives of intrinsic dimension N -1 and connec- 
tivity RC. This is a recursive procedure. E.g. a foreground hyper-volume is made by 
encircling it by a closed foreground surface, etc. 
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5 Conclusions 

In this paper we deseribed a framework for mathematieal morphology or eellular logie 
operations that ean in prineiple be extended to N dimensional (hyper) eubie tessellated 
images. We foeussed on the eonstruetions of shape primitives that ean be used to 
deteet shape properties loeally in the image. Shape primitives are grouped in intrinsie 
dimension sets, sueh as volume, surfaee and eurve primitives. The shape properties 
ean be used to identify parts (e.g. all surfaee edges) and henee help to segment the 
image. By tagging eaeh primitive with a eertain eontribution, measurements that are 
more aeeurate ean be done. Finally, they ean be used as topology breaking eonditions 
for the generation of skeletons. Beeause they are the union of a sub-set of eonditions, 
this enables the smooth ereation of various skeleton variants. The number of shape 
primitives is remarkably low in X 3 . A skeleton of a 200^ image takes about 5 minutes. 

The method is extendable to higher dimensions. This is straightforward for simple 
morphologieal operations, however, not simple for shape primitives. A method was 
presented to automatieally generate the shape primitives in X 3 and its extension to 
X 4 was indieated. The verifieation of the generated X 4 primitives eomparing it with a 
skeleton for X^ based on an Euler eount / eluster count method, is ongoing research 
[27]. 
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Abstract. The aim of this paper is to present a new algorithm to cal- 
culate the confinement tree of an image - also known as component tree 
or dendrone - for which we can prove that its worst-case complexity is 
0{n log n) where n is the number of pixels. More precisely, in a first part, 
we present an algorithm which separates the different kinds of operations 

- which we call scanning, fusion, propagation, and attribute operations 

- such that we can separately apply complexity analysis on them and 
such that all operations except propagation stay in 0(n). The imple- 
mentation of the propagation operations is presented in a second part, 
first in O(n^), where Un is the number of nodes in the tree (rin < n). 
This is sufficient if the number of pixels is much larger than the number 
of nodes (rin << n). Else, we show how to obtain 0(rin log rin) com- 
plexity for propagation. We construct two example images to investigate 
the behavior of two known algorithms for which we can show worst-case 
complexity of 0{n^ logn) and O(n^), respectively, and we compare it to 
our algorithm. Finally, a practical evaluation will be opposed to the the- 
oretical results. Several variations of the implementation will show which 
operations are time consuming in practice. 



1 Introduction 

In the last years, the confinement tree proved to be a useful tool in image 
processing. This summary of an image in which each node corresponds to an 
image region (called a confiner) has been (re) discovered by several authors 
according to various applications in image processing, but its origins lie in 
statistics [14,6]. Applications have been image filtering, segmentation [5, 4, 3, 7], 
matching [10,9,12,11], and classification [10], object detection [5] and recogni- 
tion [10,12]. Structures similar to the confinement tree are investigated in [13]. 
Note that especially when we are concerned with 3D imaging the time to calcu- 
late the tree (or the trees for each of the 2D slices forming the 3D image) becomes 
prohibitive. For 3D reconstruction in X-ray audio-graph imaging we have to align 
(match) up to 1000 2D image slices and to calculate for each one, as well as for 
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the whole 3D image, the confinement tree for image matching [11]. Fast exe- 
cution time is also required for segmentation of cell regions in high-throughput 
scanning [15]. 

In this paper we want to present and investigate the difficulties that ap- 
pear when looking for an efficient implementation of the confinement tree and 
to propose possible solutions. There is a need for such investigations because 
even if for some applications very fast execution times are required, in a first 
approach, people focus more on the ability of the algorithm to solve the stud- 
ied problem than on its optimization. For instance, Guillataud [4] proposed an 
implementation which determines the confiners successively and independently 
at each level and [5,3] made only suggestions for an algorithm which “may be in 
almost-linear time” [3] based on immersion simulation. Jones [7,8] made a first 
attempt to consider also the optimization problem. He obtained in general much 
faster execution times but for certain images (as shown on theoretical examples 
in section 3) we can considerably improve the fastness. In worst-case we can 
improve the complexity by passing from 0(n^ logn) to O(nlogn). Salembier et 
al. [13] presented already before an efficient algorithm to calculate the max-tree 
which is a tree related to the confinement tree (see section 3). We apply similar 
basic techniques as Jones in [8] but our global strategy is different. In addition 
we will give more details (sections 3.3 and 3.2) and provide a complexity analysis 
in section 4. Our experimental evaluation in section 5 will show where to achieve 
more efficiency in practice. Before presenting our algorithm in section 3 we give 
first definitions and notations in section 2. 

2 Definitions and Notations 




node (2,1) 




U 



( 0 , 1 ) 



(b) 



Fig. 1. Illustration after [4] for (a) the definition of the confiners and (b) the 
confinement tree 



Given an intensity function / : ^ M>o,d = 2,3, the confiners are 

defined as the maximal connected subsets (i.e., components) Ci of the level 
sets Cl = {x e M^|/(x) > /},/ F M>o (see Figure 1(a)). Gonsidering them 
taken on several levels lk,k = (r from resolution) including the 0 level. 
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they define obviously a tree (by “set inclusion”; see Figure 1(b)) called confine- 
ment tree. In practice, the levels are those of the image’s grey intensity function 
and we take all available grey levels into account. If we do not represent identical 
confiners in the tree but only the highest among them (see Figure 2) we call this 
the unique representation. We choose the discrete dA — distance for defining the 
connectivity between pixels (see [7] for illustrations). 

We found these components (confiners) and tree first in the classification 
domain in a paper by Wishart [14] and further investigations by Hartigan [6], 
where the confiners are called high density clusters. In image processing these 
sets and this structure appear first in [5]. See [14,5,10,11] for the invariance 
properties of the confinement tree. Another kind of components, induced in a 
natural way by the grey level function, are the connected components of the sets 
{x G M^|/(z) = /} [13]. They are called flat regions. We will use throughout this 
paper the notations n for the number of pixels in the image, for that of nodes 
in the tree, and r for the number of grey levels. We assume r < n. 

3 Implementation 

When designing an efficient algorithm implementing the confinement tree, there 
arise some difficulties from the necessity to scan at each grey level only pixels 
bringing a new knowledge about the tree and to propagate information about 
connectivity of regions in the image. Hereby, we will distinguish in the follow- 
ing attribute operations^ applied to update the tree attributes, and connectivity 
operations which are necessary to find the connection from a pixel to a confiner 
and to establish the father-son relationship in the tree. They include scanning^ 
fusion^ and propagation operations^ the latter are needed to propagate neighbor- 
ship information through the already updated part of the tree. 
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Fig. 2. Example images: (a) for worst case analysis for the algorithms of Jones 
[8] and Salembier et al. [13]; (b) for comparison with our algorithm. On the cor- 
responding confinement trees we have chosen unique representation of identical 
confiners. We assume 4-connectivity 
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Algorithms to calculate confiners for a given grey level in linear time with 
respect to n are well known [1]. However, when we look for an algorithm which 
calculates the confiners for all grey levels but stays within 0{n) attribute op- 
erations and O(nlogn) connectivity operations there arise some complications 
making it worth to be detailed. 

A first attempt at designing an efficient algorithm tackling this problem was 
made by Jones [7,8]. His algorithm calculates first all local maxima of the image’s 
grey level function (i.e., the leaves of the tree). After it detects, starting from 
each local maxima in sequential order, the confiners at the different grey levels 
on the path (called branch [8]) between the leaf and the root which are not 
already updated. For each leaf m the km pixels of these confiners are updated 
using a priority data queue in which they are ordered according to their grey 
value. In Figure 2(a) this will be done, for a given leaf m, in 0(km log km) time. 
The pixels associated to the leaf at level 25 are delimited by the continuous 25- 
line. Pixels in not updated confiners on the branch between leaf 24 and the root 
are confined by the 24-line and the longer part of the 25-line, etc. Therefore, if 
- as in Figure 2 - many isolated local maxima with high grey level appear in 
the image the algorithm examines multiple times large parts of it, even if is 
small as depicted in Figure 2(b) (see also section 5). The worst case behavior 
(Figure 2(a)) is 0(n^ logn). If we try to overcome the problem by stopping this 
detection process for a given leaf as soon as the grey levels increase we have to 
pay attention to the fact that in this case two higher branches (as 23 and 24 
in 2(a)) can be separated by lower ones (as 19 and 22). We would need procedures 
as we will propose in steps (B2 1) and (B3 /). 

Another approach for calculating the confinement tree would be to add at- 
tribute operations to the max-tree creation procedure and then to derive the 
confinement tree from the max-tree. Salembier et al. [13] presented an elegant 
algorithm to calculate the max-tree and achieved very fast results in practice. 
Applied to the image of Figure 2(a) however, their algorithm runs in 0{n^) 
time. As their paper did not address principally to the optimization problem, a 
detailed evaluation is not given. 

In the following, we describe first globally the algorithm, after we give detailed 
information for procedures implementing it, as presented in the Figures 4, 5, 6. 



3.1 Global Strategy 

The discrete connectivity model we have chosen is that of 4-connectivity (for 
illustration see [7]). The pseudo-code used below is based on ANSI-C. 



Data Structure of the Result to be Determined Our algorithm aims to 
calculate the components of the following data structure: for each confiner (node) 
we keep in a structure (struct) (1) its surface, (2) its mass, (3) the X and the 
Y coordinates of its gravity center, (4) the eastmost among its northmost pixels 
(to be able to recalculate easily its contour), and (5) a number for identifying 
the node of its father in the array Tree defined next. The tree is implemented 
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then as a 2 dimensional (dynamic) array Tree. It associates to a given level I 
(in the tree) and to the given number z of a node at this level the structure of 
the corresponding confiner (node) Cu. Cu is also at level I in the image F (in 
the procedures we will use the notation L for the level). In an additional array 
NodeNumb we keep the number of nodes at the different levels. In the description 
of our implementation we will first represent identical confiners (see Figure 3(c), 
confiner 4 at the levels 10, 11, 12) at each new level again. After, we will show 
what is to change when representing only the confiner at highest level among 
identical confiners (“unique representation”). This is necessary for worst case 
analysis. 



Principal Steps of the Algorithm The algorithm proceeds in two steps: first 
(A) the pixels are sorted in decreasing order according to their grey level. The 
order at the same level is arbitrary. This is done by a standard sort algorithm 
called “counting sort” [2] in 0(n + r) = 0{n) (we assume r <n) time. In the sec- 
ond step (B) we calculate the confiners for all grey levels I (step B /) in decreasing 
order, beginning with the highest grey level, and we update the tree structure 
up to the given level. Step (B 1) consists of four sub- steps (B1 /), ..., (B4 /). Step 
(B1 /) (Figures 3(a), 4) implements the scanning operations and determines the 
fiat regions at level 1. For each pixel the number of the corresponding fiat region 
is provisionally kept in memory (in Pix2cf , see below). Step (B1 1) determines 
also which already updated confiners (obtained using Pix2cf ) - and even their 
ancestors at level / -hi using the data structure maintained in step (B3 m), m > / 
- are adjacent to such a region. This information is kept in memory (in Fusion, 
see below). We deduce from it in step (B2 /) (Figures 3(b), 5) which confiners 
at higher levels and which fiat regions at level I are fusing for obtaining the new 
confiners at level I (stored then in Pix2cf ). Step (B2 /) implements the fusion 

^^^¥li^^we have to know (in constant time if possible) for a given confiner its 
ancestor at level I -h 1 (Figure 3(a),(c)), in order to fuse the confiners and fiat re- 
gions, is one of the major difficulties (with respect to worst-case behavior) of the 
confinement tree construction. To do this, in step (B3 /), detailed in section 3.3, 
we define and maintain (by the propagation operations) a data structure associ- 
ated to the tree. It is called Node2ances and replaced in the optimized versions of 
(B3 /) (cf. section 3.3) by Leaves2ances or by SubtreeL2ances. Finally we pass 
in step (B4 /) once more through all pixels at level I to update (by the attribute 
operations) the attribute values of the now identified confiners (Figure 6). 

Step (B3 /) is crucial for worst-case analysis. In section 3.3 we present an 
algorithm (Figure 7) to execute step (B3 /) in Ts(/) time such that = 

0{nn log Un) = 0{n log n) time, assuming unique representation of identical con- 
finers. Steps (Bz /), i = 1,2,4, are executed in T^(/) = 0{n) time by the pro- 
cedures presented in Figures 4, 5 and 6. However, as we have in imaging practice 
often Un << n, step (B3 /) will not in general be the most consuming one (cf. 
section 5). 
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Fig. 3. (a) Part of an image to illustrate the detection of flat regions and confiners 
for a given grey level: dashed lines delimit flat regions at the current level 9 
and continuous lines delimit confiners at level 10; the numbers on the lines are 
that of the corresponding confiners or regions respectively; different arrow types 
correspond to different cases in Figure 4 (see section 3.2). (b) Boxes contain 
confiner numbers at level 10 (upper row ) and 9 (lower row) and flat region 
numbers (middle row). Broken lines and (as in (c)) white boxes correspond to not 
shown parts of the image. Arrows illustrate (as in (c)) the association made by 
the stated data structures; we omit in Region2conf implicitly given associations 
between the confiners at levels 9 and 10. (c) Confinement tree updated up to 
level 9; broken lines illustrate information added in step (B 9) 

Four Principal Data Structures The sub- steps (B1 /), ..., (B4 1) are related 
by four data structures essential for our algorithm. There are (cf. Figure 3) 
Pix2cf , Fusion, Region2conf , and, in a first approach, Node2ances which will 
be replaced by Leaves2ances or by SubtreeL2ances in section 3.3 when showing 
how to pass from 0{n^) complexity to 0{n log n) for (B3 /). Fusion is a dynamic 
array of stacks (see below); the other structures are dynamic arrays of integer 
values. Let us detail now, how this structures are defined and cooperate during 
the sub-steps (B1 /), ..., (B4 /). 

To each flat region at level /, determined in step (B1 /), we allocate a num- 
ber Ri'i which we store provisionally in each of its pixels (in an array Pix2cf ). 
Equally, we store for each (already discovered) confiner at level I + 1 

adjacent to such a region this number Ri^ in a stack Fusion(Cj(/+i)) and vice 
versa in a stack Fusion(i?^//). See in step (B3 1) and in Figure 3(a), (c) 

how we find the number of . The stacks in Fusion permit (as illustrated in 

Figure 3(b)) to connect in step (B2 1) all flat regions Ri'i and all confiners 
at the previous level which belong now to the new confiner Cu at the current 
level 1. Therefore, these connections enable us also to establish the father-son 
relationships between the confiners (nodes) at two successive levels. Fusion is 
implemented as an array containing after step (B1 1) the stacks Fusion(Cj(/+i)) 
for all confiners at level I + 1 and the stacks Fusion(i?^//) for all flat 

regions Ri'i at level 1. Using Fusion we merge in step (B2 1) all connected 
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regions in order to obtain the confiners at level L Here, we associate in the 
array Region2conf (see Figure 3(b)) each number identifying one of the confin- 
ers at level I -h 1 and each flat region number Ri'i with the number of the 

new confiner Cu containing the given confiner or region. Using Region2conf , we 
replace in the array Pix2cf the flat region number by that of the confiner Cu 
enclosing it. This is done by the procedure in Figure 5. 

Step (B3 1) operates on the tree only. We update an array Node2ances (in 
a first approach: more sophisticated approaches to implement step (B3 1) are 
proposed in section 3.3; they do not require changes in the other parts of the 
algorithm). Node2ances maps (Figure 3(c)), before step (B3 /), each already 
determined node to its ancestor at level I 1 and to its ancestor at level /, after 
(B3 /). This will be necessary for the steps (B1 / — 1) and (B2 / — 1): we stored 
(in the array Pix2cf) in the steps (B2 m), m > /, for each pixel at level m 
the number of the node/confiner at level m containing it but we will need its 
ancestor node at level 1. In step (B4 /), finally, we examine once more all pixels 
at level 1. We add to each new node first the attribute information coming from 
its sons in the tree by the procedure addNode(node,prevNode) and after the 
attribute information coming from the pixels by addPixel (node , pixel) . The 
corresponding procedure is given in Figure 6. 



3.2 Our Algorithm in Detail 

Let us give now some more detailed information for the procedures presented in 
the Figures 4, 5, 6 and show some techniques to speed up the algorithm. 



1. init (Fusion) ; /* initial. Fusion as array of empty stacks */ 

2. RegNumber = NodeNumb [L+1] ; 

3. for each regElement at level L do 

4. if (Pix2cf [regElement] == 0) /* new flat region */ 

5. RegNumber++; /* Number RiM of this new flat region */ 

6. Pix2cf [regElement] = RegNumber; 

7. push(pixS, regElement); /* init. of stack pixS to scan it*/ 

8. while (pop(pixS, pixel) != 0) 

9. for all nghb of pixel 

10. if ((F [nghb] == L)&& !Pix2cf [nghb] )/*arrow: cont . line*/ 

11. Pix2cf [nghb] = RegNumber; 

12. push(pixS, nghb); 

13. else if (F[nghb] > L) /* arrow: dots + dashes */ 

14. node = Pix2cf [nghb] ; /*node containing nghb and its*/ 

15. anc = Node2ances [F [nghb] ] [node] ; /* ances. at L+1 */ 

16. push(Fusion[anc] , RegNumber); 

17. push(Fusion [RegNumber] , anc); 

Fig. 4. Procedure to detect the flat regions and to establish the inter-region 
connexions (step B1 L) 
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Variables As a general convention we use capitals at the beginning of a variable 
name only if the variable appears also outside the procedure. These variables are 
F, Tree, and NodeNumb as described above, RegNumber (= Ri>i and = number 
of confiners at level I + 1 plus flat regions at /, after (B1 /)), L= /, and R= r, 
and the four data structures Pix2cf , Fusion, Node2ances, and Region2conf . 
These four variables, described already above and illustrated in Figure 3, are 
internal variables of (B) but Pix2cf and Node2ances are input and output for 
each step (B 1) whereas Fusion and Region2conf are cleared after step (B2 1). 
RegNumber reappears indirectly in line 12 of (B2 1) but is an internal variable of 
(B /). Outside the procedures, after (A), Pix2cf is initialized to zero for each 
pixel, and we set NodeNumb [R+1] =0. All variables beginning with a small letter 
are internal variables of the respective procedures. 



1. nodeNumber = 0; init (Region2conf ) ; /*init . all array el. to 0*/ 

2. for each prevNode at level L+1 do 

3. if (Region2conf [prevNode] == 0) /* new confiner at L*/ 

4. nodeNumber++ ; 

5. Region2conf [prevNode] = nodeNumber; 

6. push(regS, prevNode); /* regS: stack of regions to fuse */ 

7. while (popCregS, regl) != 0) 

8. while (pop (Fusion [regl] ,reg2) != 0) 

9. if (Region2conf [reg2] == 0) 

10. Region2conf [reg2] = nodeNumber; 

11. push(regS, reg2) ; 

12. for each flatReg at level L do 

13. if (Region2conf [flatReg] == 0) /* new leaf in the tree */ 

14. nodeNumber++ ; 

15. Region2conf [flatReg] = nodeNumber; 

16. NodeNumb [L] = nodeNumber; 

17. for each regElement at level L do 

18. Pix2cf [regElement] = Region2conf [Pix2cf [regElement] ] ; 

Fig. 5. Procedure to associate the flat regions and the confiners at level L+1 with 
the new confiners (nodes). The association is kept in the array Region2conf (step 
B2 /, /=L) 



Sub-steps (B1 1 ) and (B2 1 ) In the procedure in Figure 4, the flat regions 
are represented by the current value of RegNumber (= Ri'i). They are initialized 
in line 5-7 with regElement which corresponds in the example in Figure 3(a) to 
the encircled pixel in the middle at level 9. Pixels at the border are supposed 
to be set at 0 (see lowest row in Figure 3(a)). The procedure (Figure 4) detects 
each pixel in each flat region following a recursive scheme. It implements this 
scheme iteratively using the data stack pixS. We will remark here that we do not 
need a priority queue as in [7], where the pixels in the queue had to be ordered 
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according to their grey level. This is because all pixels which put in the queue 
have the same grey level L. The auxiliary function push(pixS, element) places 
element at the top of the stack pixS as push (Fusion [number 1] , number2) 
places the number number2 at the top of the stack Fusion [number 1] . pop (pixS , 
element) removes the element at the top of the stack and stores it in element. 
It returns 1 if there was an element in the stack, else 0. For a pixel fetched 
from pixS (line 8) we will examine all neighbors nghb (line 9): (i) if the neighbor 
is in the same flat region and not treated up to now (this corresponds to the 
continuous arrows in Figure 3(a)), it will be treated in lines 11, 12; (ii) if the 
neighbor is in a confiner at a previous level (arrow with dashes and dots), lines 
14-17 are executed; (iii) else, if the neighbor is at a lower level or already updated 
(dashes), nothing is done. In case (ii) the connexions between the lowest updated 
confiner containing the neighbor (represented by anc) and the current flat region 
(represented by RegNumber) are stored in Fusion (line 16, 17; Figure 3(b)). 

The array Region2conf in Figure 5 is of length RegNumber and its indices 
correspond to the confiners/ nodes at the previous level I -T 1 followed by the flat 
regions at level I as represented by Ri'i (Figure 3(b), line 2 and 5 in Figure 4). 
The Ri'i are stored in the Fusion stacks. The procedure in Figure 5 implements 
step (B2 1) following a very similar scheme up to line 11 as that implementing 
(B1 /). The differences are that only the number of the new confiner has to be 
kept (line 10 which corresponds to line 11 in Figure 4 and lines 9-11 against 
lines 10-17 in Figure 4) and that the number of connexions changes for different 
regions and confiners (line 8 against line 9 in Figure 4). Lines 12-15 detect the 
new leaves. Figure 6 implements step (B4 1) as already described above. The 
remaining step (B3 1) is detailed in section 3.3. 



Ways to Speed Up the Algorithm To improve in practice the calculation 
time, first, we will implement a way to store a connexion between a confiner 
and a flat region just once in the corresponding stack of Fusion during step 
(B1 /) (Figure 4). We use an array which marks for each confiner at the previous 
level if the current flat region has already been treated. We refer to this in 
section 5 as “mark_neighConf ”. We achieve in a similar way, to deal with a 
tree implementation using unique representation of identical confiners where we 
mark the confiners adjacent to a flat region at the current level. 

The second improvement for which we will present timings in section 5 is an 
attempt to implement all stacks in the Fusion array in one big array. It will be 
referred below as “arrayFusion_Proc”. 

3.3 Propagating Information through the Tree 

Let be given a rooted tree T known from its highest level down to a level I 
where the levels I decrease iteratively from the highest to the lowest one. This 
subsection addresses the problem of maintaining a data structure associated to T 
permitting, at each iteration, to find in constant time for any node at any level 
V > I its ancestor at level 1. 
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init (Tree [L] ) ; /* initial. Tree [L] [node] to 0 for all nodes at L */ 
for each prevNode at level L+1 do 
node = Region2conf [prevNode] ; 
addNode (Tree [L] [node] , Tree [L+1] [prevNode] ) ; 
for each regElement at level L do 

addPixeKTree [L] [Pix2cf [regElement]] , regElement) ; 

Fig. 6. Procedure updating the tree with the attributes of the confiners at level 
L (step B4 L) 



In a first approach, we implement this data structure using the array 
Node2ances (see Figures 3(c), 4) associating a node to its ancestor at the re- 
quired level 1. We update Node2ances for each new level / — 1 by replacing for 
each already updated node the number of its ancestor at level I by that at level 
l — l. This is illustrated in Figure 3(c). We call this procedure “all_nodes_Proc”. 




Fig. 7. (a), (b) Two ways to maintain a data structure permitting to find for each 
leaf in constant time its ancestor at level /; (c) example for worst-case analysis 
of all_leaves_Proc 



We can reduce the operations by associating to each node one of its descen- 
dant leaves. Then, we can reduce the above stated problem to leaves instead of 
nodes by implementing the required data structure as an array Leaves2ances 
and proceeding as in the procedure before, but updating the ancestor associa- 
tions only for each leaf. This is illustrated in Figure 7(a). We call this procedure 
“all_leaves_Proc”. However, as we will see in section 4 the worst-case com- 
plexity of all_leaves_Proc stays O(n^) as for the procedure all_Qodes_Proc. 

We show now, how to stay - as proven in section 4 - within a worst-case 
complexity of 0(n log n). We have to proceed as illustrated in Figure 7(b). For 
a given subtree, only for one leaf m we set an association to the subtree root. 
We can implement this association either directly as a new field of a leaf or as 
a separate list or array SubtreeL2ances. The other leaves in the subtree point 
to m and get the ancestor value from m. If a bifurcation appears in a node 6, we 
have to change the pointers of the leaves in the new subtree rooted in b to point 
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all to one single leaf of this subtree. The number of operations for this step is 
limited by the number of leaves in the subtree (see the proof of proposition 2 in 
section 4, where we call this procedure “subtreeL_Proc”). 

4 Complexity Analysis 

Lemma 1. Ifn is the number of pixels of an image and ifnn denotes the number 
of nodes in the eorresponding eonfinement tree, we have Un <n. 

Proof The lemma follows from the unique representation of identical con- 
finer s. 

Given a rooted tree with nodes (leaves included), how many nodes have 
to be passed through in order to go from all leaves to the root? One could think, 
the answer is 0(n^ logn^) as in two extremal cases - the complete binary tree 
and the tree with — 1 leaves - at most 0{nn log n^) passes are necessary, even 
only 0{nn) in the latter case. But Figure 7(c) illustrates a case^ where 0{n^) 
passes are required. This is also the worst-case as there are at most — 1 leaves 
and at most nodes on the path between a leaf and the root. This proves the 

Proposition 1. The proeedure all-leaves J^roc, given in seetion 3.3, updates 
the array Leaves2ances for all levels in a total time O(n^). 

Proposition 2. The proeedure suhtreeL-Proc, given in seetion 3.3, updates 
the array SubtreeL2ances for all levels in a total time O(nnlognn). 

Proof We will prove that we have at most log operations without 
counting the 0{nn) initialization operations. This, we will prove by induction 
on the number of nodes of the tree. The following logarithms are at base 2. 
For Un = 1 (initialization of the induction) there is nothing to show. We will 
consider the root of the tree and suppose that it has /c(A;>2;ifA; = lwe 
have {nn — 1) log (n^ — 1) + 1 < log n^) sons defining k subtrees for which the 
induction hypothesis is true. Therefore, and as we need less than operations 
when passing from one level to the next one (see section 3.3), we can reduce 
the problem to the following statement: ^zlog^z + ^ ^nlog^n,^n = 

1 -h Yli=i ^ we are in the convex part of the function x log x (as ni > 

1) this follows directly from Jensen’s inequality for convex functions: we have 

i J2i=i log rij < (i J2i ^i) log J2i ^i) from which follows Y.i=i log + 
nnlogk < nnlog{nn — 1) < nnlogn^. As 1 < logk {k > 2) this proves our 
statement and therefore our proposition 2. 

Theorem 1. The algorithm ealeulates the eonfinement tree in O(nlogn) time 
using 0{n) seanning, fusion, attribute and O(nnlognn) propagation operations. 



^ An image corresponding to this tree is possible, even for nn = nj2 
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Proof We will just prove that the total time to execute step (B2 1) for all 
levels I is 0{n). The rest follows either from lemma 1, proposition 2 or is clear. In 
step (B2 /), for a given level /, each flat region number at I and each number of a 
confiner at level / + 1 is put exactly once in the data stack regS. For each element 
in the data stack the number of operations is the number of its connexions with 
flat regions or confiner s. The number of all this connexions at a level I is limited 
by 4 times the number of pixels (forming the flat regions) at grey level I as each 
pixel has at most 4 neighbors. Therefore, the total number of connexions for all 
levels I is within 0{n) and our theorem is proven. 

5 Experimental Evaluation 

We measured the execution times for the algorithm and its parts as presented 
in the procedures of the Figures in section 3.2 for several images on a Intel 
Pentium III, 450 MHz. The timings are measured with the clock () -function 
under LINUX operation system and the results are summarized in figure 8 as 
well as information about the images (they have all 8 bit grey values; the MRI 
brain image is presented and treated in [12,11] and the microscopic images A, 
B in [11]). We observe stable behavior of our algorithm for approximately the 
same number of leaves and nodes but also increasing importance of step (B3 1) 
for larger images with larger trees: the propagation operations are about 10 
times more time consuming for the microscopic images C and D with respect 
to A and B, whereas the other operations increase only 3-4 times. However, 
as Un << n the propagation operations does not become dominant. At the 
contrary. Figure 2(b) illustrates that the execution time of the algorithm of 
Jones [8] can become prohibitive even if Un « n: for an increasing number of 
maxima (5 in 2(b)), all operations are multiplied by a factor depending linearly 
on this number. Figure 2(b) presents a synthetical image, however, the timings 
measured by Jones [8] on real images allow to conclude that our algorithm can 
be 10 up to 50 times faster in practice. 

6 Conclusion 

We presented a new algorithm for calculating the confinement tree and could 
prove that its worst-case complexity is O(nlogn). Moreover, the algorithm sep- 
arates different kinds of operations such that all operations except propagation 
stay in 0{n) time. We showed how to achieve complexity 0{nn log Un) for propa- 
gation operations where is the number of nodes in the tree (in a first approach, 
we implement propagation in O(n^), sufficient if << n). Two example images 
illustrate 0{n^ log n) complexity for a known algorithm, prohibitive computation 
time for this known algorithm even if << n, and 0(n^) for an algorithm cal- 
culating a tree related to the confinement tree. Practical evaluation allows to 
conclude that our improvement leads to much faster calculation times on stan- 
dard medical images. 
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Fig. 8. Test images and execution times in seconds for the implementation of 
our algorithm 
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Abstract. Thinning on a binary picture is an iterative layer by layer 
erosion to extract a reasonable approximation to its skeleton. This paper 
presents an efficient 3D parallel thinning algorithm which produces me- 
dial surfaces. Three-subiteration directional strategy is proposed: each 
iteration step is composed of three parallel subiterations according to 
the three deletion directions. The algorithm makes easy implementation 
possible, since deletable points are given by matching templates contain- 
ing twentyeight elements. The topological correctness of the algorithm 
for (26, 6) binary pictures is proved. 



1 Introduction 

Skeletonization provides shape features that are extracted from binary image 
data. A very illustrative definition of the skeleton is given using the prairie-fire 
analogy: the object boundary is set on fire and the skeleton is formed by the 
loci where the fire fronts meet and quench each others [3]. In discrete spaces, the 
thinning process is a frequently used method for producing an approximation to 
the skeleton in a topology-preserving way [6] . It based on digital simulation of 
the fire front propagation: border points of a binary object that satisfy certain 
topological and geometric constraints [21] are deleted in iteration steps. The 
entire process is repeated until only the “skeleton” is left. Therefore, a thinning 
algorithm can be regarded as a reduction operation that changes some I’s (object 
elements or black points) to O’s (white points) but does not alter O’s. 

A reduction operation does not preserve topology if 

— any object in the input picture is split (into two or more) or completely 
deleted, 

— any cavity in the input picture is merged with the background or another 
cavity, or 

— a cavity is created where there was none in the input picture. 

There is an additional concept called hole in 3D pictures. A hole (that dough- 
nuts have) is formed by O’s, but it is not a cavity [6]. Topology preservation 
implies that eliminating or creating holes is not allowed. 
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A simple point is an object point whose deletion does not alter the topology 
of the picture [13]. Sequential thinning algorithms delete simple points which 
are not end points, since preserving end-points provides important information 
relative to the shape of the objects. Curve thinning (i.e., a thinning process 
for extracting medial line) preserves line-end points while surface thinning (i.e., 
a thinning process for extracting medial surface) does not delete surface-end 
points. 




Fig. 1. A 3D synthetic picture containing a character “A” (left), result of a 
surface thinning process (centre), and result of a curve thinning process (right). 
(Cubes represent black points.) 



Parallel thinning algorithms delete a set of simple points. A possible approach 
to preserve topology is to use directional strategy; each iteration step is composed 
of a number of parallel subiterations where only border points of certain kind 
can be deleted in each subiteration. There are six kinds of border points in 3D 
pictures on cubic grid, therefore, 6-subiteration directional thinning algorithms 
were generally proposed. 

In this paper, a 3-sub iter at ion directional algorithm is proposed for surface 
thinning. Some experiments are made on synthetic objects and the topology 
preservation for (26,6) binary pictures [6] is proved. Our approach demonstrates a 
possible way for constructing non-conventional directional thinning algorithms. 

2 Basic Notions and Results 

Let p be a point in the 3D digital space 7Z^ . Let us denote A/’j(p) (for j = 
6, 18,26) the set of points j-adjacent to point p (see Fig. 2). The sequence of 
distinct points (xq, xi, . . . , is a j-path of length n > 0 from point xq to 
point Xn in a non-empty set of points X if each point of the sequence is 
in X and Xi is j-adjacent to x^-i for each 1 < i < n. (Note that a single 
point is a j-path of length 0.) Two points are j-connected in the set X if there 
is a j-path in X between them. A set of points X is j-connected in the set of 
points y D X if any two points in X are j-connected in Y. 
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The 3D binary (m,n) digital picture P is a quadruple V = m^n^B) [6]. 
Each element of 7Z^ is called a point of V. Each point in 5 C is called a 
black point and value 1 is assigned to it. Each point in is called a white 

point and value 0 is assigned to it. Adjacency m belongs to the black points 
and adjacency n belongs to the white points. A black component (or object) is a 
maximal m-connected set of points in B. A white component is a maximal n- 
connected set of points in 5 C . 

We are dealing with (26,6) pictures. It is assumed that any picture contains 
finitely many black points. 




Fig. 2. The frequently used adjacencies in . The set Nq{p) contains the cen- 
tral point p and the 6 points marked U, D, N, E, S, and W. The set N\^{p) 
contains the set Nq{p) and the 12 points marked The set N 2 q{p) contains 
the set N\^{p) and the 8 points marked 



A black point is called border point if it is 6-adjacent to at least one white 
point. (Note that this definition is correct only for the special cases m = 26 and 
m = 18.) A border point p is called \J -border point if the point marked by U 
in Eig. 2 is white. We can define N-, E-, S-, W-, and D-border points in the 
same way. 

A black point is called simple point if its deletion does not alter the topology 
of the picture. We make use of the following result for (26,6) pictures: 

Theorem 1. [12,19] Black point p is simple in picture (^^,26,6,5) if and 

only if all of the following conditions hold: 

1. the set {B\{p}) H N 26 {p) contains exactly one 26-component; and 

2. the set {Z\B) H Nq{p) is not empty and it is 6-connected in the set 

(^^s)niVi8(p). 

Theorem 1 shows that the simplicity in (26, 6) pictures is a local property; it 
can be decided using the 3x3x3 neighborhood of a given point. 

We need to consider what is meant by topology preservation when a number 
of black points are deleted simultaneously. Ma [8] and Kong [5] gave sufficient 
conditions for parallel reduction operations of 3D (26,6) pictures. We use the 
following more general sufficient conditions: 
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Theorem 2. [17,18] LetT he a parallel reduetion operation on (26,6) pietures. 
Then T is topology preserving, if for all pieture V = (^^,26,6,5), all of the 
following eonditions hold: 

1. for all points p G B that are deleted by T and for all sets Q C {Nis{p)\{p})n 
B that are deleted by T, p is simple in the pieture 26, 6, 5\Q); and 

2. no blaek eomponent eontained entirely in a unit lattiee eube (i.e., a 2 x 2 x 
2 eonfiguration in ) ean he deleted eompletely by T . 

We propose a surface thinning algorithm. The deletable points of the algo- 
rithm are border points of certain types and not surfaee end-points (i.e., which 
are not extremities of surfaces). The proposed algorithm uses the following char- 
acterization of the the surface end-points. 

Definition 1. [17,18] The set Nq{p) is subdivided into three kinds of opposite 

pairs of points (U,D), (N,S), and (E,W) (see Fig. 2). 

A blaek point p is a surfaee end-point in a pieture if the set Nq{p) eontains at 
least one opposite pair of white points. 

3 Existing Parallel Thinning Algorithms 

Most of the existing thinning algorithms are parallel, since the fire front prop- 
agation is by nature parallel. Those algorithms delete a set of simple points 
simultaneously that can alter the topology. There are three major strategies to 
overcome this problem: 

— Fully parallel algorithms: 

Algorithms from this group do not divide an iteration step into subiterations. 
In order to preserve topology, the known three fully parallel 3D thinning al- 
gorithms investigate larger neighborhood than the 3x3x3 one: Ma proposed 
an algorithm, in which the new value of a black point depends on 30 points 
(and a parallel rechecking pass is required) [9], the fully parallel algorithm 
of Ma and Sonka uses a special neighborhood containing 50 points [10], and 
Manzanera et al. developed an algorithm using a symmetric neighborhood 
consisting of 81 points [11]. 

— Directional (or border sequential) algorithms: 

Iteration steps are divided into a number of successive subiterations, where 
only border points of certain kind can be deleted in parallel in each subit- 
eration. Consequently, each subiteration uses different deletion rule. Since 
there are six kinds of major directions in 3D pictures, 6-subiteration di- 
rectional thinning algorithms were generally proposed [2,4,7,14,15,22]. Note 
that Palagyi and Kuba developed 8-subiteration [17] and 12-subiteration [18] 
directional thinning algorithms, too. Each existing directional algorithm ex- 
amines the 3x3x3 neighborhood of each border point. 



410 



Kalman Palagyi 



— Subfield sequential algorithms: 

The 3D digital space 7Z^ is subdivided into more disjoint subfields that are 
alternatively activated. At a given iteration step, only border points in the 
active subfield are designated to be deleted. Each subiteration is executed in 
parallel (i.e., all border points in the actual subfield satisfying the deletion 
condition are simultaneously deleted). Two subfield sequential 3D thinning 
algorithms working in cubic grid has been proposed so far [1,20]. Both al- 
gorithms investigate the 3x3x3 neighborhood and use eight subfields, 
therefore, each iteration step contains eight successive subiterations. Note 
that Palagyi and Kuba proposed a hybrid thinning algorithm [16]. It uses 
both subfield sequential and directional approaches (with two subfields and 
eight deletion directions). 

The algorithm proposed in this paper follows the directional strategy. It 
requires only three subiterations (corresponding to the three kinds of opposite 
pair of points) in each iteration step, but an additional point not in the 3 x 3 x 

3 neighborhood is examined in each subiteration. 

4 The New Thinning Algorithm 

In this section, a new algorithm is presented for extracting medial surfaces from 
3D (26, 6) pictures. 

Each conventional 6-subiteration directional thinning algorithm uses the six 
deletion directions that can delete certain U-, D-, N-, E-, S-, and W-border 
points, respectively [2,4,7,14,15,22]. In our 3-subiteration approach, two kinds of 
border points can be deleted in each subiteration. The three deletion directions 
correspond to the three kinds of opposite pairs of points, and are denoted by 
UD, NS, and EW. 

Suppose that the 3D (26,6) picture to be thinned contains finitely many 
back points. Reduction operations associated with the three subiterations are 
called deletionjfrom.UD, deletionjfromJJS, and deletiomfrom_EW. We are 
now ready to present the 3-subiteration approach formally: 

Input: picture V = {7Z^ ^ 26, 6, B) 

Output: picture V = (^^,26,6,5') 

3-subiteration_thinning(5 ,50 
begin 
B' = B', 
repeat 

B' = deletionjfrom_UD(5'); 

B' = deletionjfromJ\fS(5'); 

B' = deletionjfrom_EW(5'); 
until no points are deleted; 
end . 
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The new value of a black point depends on 28 points in each sub iteration. The 
three special neighborhoods assigned to the different subiterations are presented 
in Fig. 3. 
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Fig. 3. The special local neighborhoods assigned to the deletion directions UD 
(a), NS (b), and EW (c), respectively. The new value of a black point p depends 
on the 3x3x3 neighborhood of p (marked “0” ) and an additional point (marked 
“♦”) that is not in N 2 e{p) 



Deletable points in a subiteration are given by a set of matching templates. 
A black point is deletable if at least one template in the set of templates matches 
it. Templates are usually described by three kinds of elements, “black”, “white”, 
and “don’t care” , where “don’t care” matches either black or white point in a 
given picture. In order to reduce the number of masks we use additional notations 
(see Fig. 4). 

The first subiteration assigned to the deletion direction UD can delete cer- 
tain U- or D-border points; the second subiteration associated with the dele- 
tion direction NS attempt to delete N- or S-border points, and some E- or 
W-border points can be deleted by the third subiteration corresponding to the 
deletion direction EW. The set of templates Tud is given by Fig. 4. Note that 
Fig. 4 shows only the eight base templates T1-T8. Additionally, all their rota- 
tions around the vertical axis belong to where the rotation angles are 90°, 
180°, and 270°. This set of templates was constructed for deleting some simple 
points which are neither surface end-points (see Definition 1) nor extremities 
of surfaces. The deletable points of the other two subiterations (corresponding 
to deletion directions NS and EW) can be obtained by proper rotations of the 
templates in Tuo- Each template of our algorithm can be given by a Boolean 
condition that makes easy implementation possible. 

Note that choosing another order of the deletion directions yields another 
algorithm. The proposed algorithm terminates when there are no more black 
points to be deleted. Since all considered input pictures are finite, it will termi- 
nate. 
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Fig. 4. Base templates T1-T8 and their rotations around the vertical axis form 
the set of templates Tud assigned to the deletion direction UD. This set of 
templates belongs to the first subiteration. Notations: each position marked “c”, 
and matches a black point; each position marked “O” matches 
a white point; each (“don’t care”) matches either a black or a white point; 
at least one position marked “x” matches a black point; at least one position 
marked “y” matches a black point. Emphasis is to be put that “x” and “y” 
positions provide that surface end-points cannot be deleted. (Note that using 
different symbols for black template positions helps us to prove the topological 
correctness of the algorithm) 



5 Discussion 

The proposed algorithm has been tested on objects of different shapes. Here we 
present only four examples (see Figs. 5-8). 

The proposed 3-subiteration thinning algorithm is topology preserving for 
(26, 6) pictures. It is sufficient to prove that reduction operation given by the set 
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of templates is topology preserving. If the first subiteration of the algorithm 
is topology preserving, then the other two are topology preserving, too, since the 
applied rotations of the deletion templates do not alter the topological properties. 
Therefore, the entire algorithm is topology preserving, since it is composed of 
topology preserving reductions. 





Fig. 5. A synthetic object containing a cube with a hole (left) and its medial 
surface produced by the proposed algorithm (right) 





Fig. 6. A synthetic object containing a cube with two holes (left) and its medial 
surface produced by the proposed algorithm (right) 



In order to prove both conditions of Theorem 2, we classify the elements of 
templates and state some properties of the set of templates The element 
in the very centre of a template is called central (marked by “c” in Fig. 4). A 
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noncentral template element is called black if it is always black (marked by 

and in Fig. 4). A noncentral template element is called white if it is 
always white (marked by “O” in Fig. 4). Any other noncentral template element 
which is neither white nor black, is called potentially black (marked by “x” , “y” , 
and in Fig. 4). A black or a potentially black noncentral template element is 
called nonwhite. A black point p is deletable if it can be deleted by at least one 
template in 7uo; P is nondeletable otherwise. 




Fig. 7. A synthetic object containing a doughnut (left) and its medial surface 
produced by the proposed algorithm (right) 



Observation 1. Let us examine the configurations illustrated in Fig. 9. 

1. Black point p in configuration (a) is nondeletable. 

2. Black point p in configuration (b ) is deletable if 

— q = 0, r = 1, and s = 1, or 

— q = 1 and r = 0. 

3. Black point p in configuration (c) is deletable if q = 1 and r = 1. 

4- Black point p in configuration (d) is deletable if q = 1. 

The topological correctness of the first subiteration of the proposed algorithm 
is stated by the following theorem: 

Theorem 3. Reduction operation given by the set of templates T^d 'Is topology 
preserving for (26, 6) pictures. 

Proof, (sketch) It is easy to see that each template in Tu^ deletes only simple 
points of (26,6) pictures. 

The first point is to verify that there exists a 26-path between any two non- 
white positions (condition 1 of Theorem 1). It is sufficient to show that any 
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Fig. 8. A synthetic picture containing four characters (top) and their medial 
surface produced by the proposed algorithm (bottom) 



potentially black position is 26-adjacent to a black position and any black posi- 
tion is 26-adjacent to another black position. It is obvious by careful examination 
of the templates in Tuo- 

To prove that condition 2 of Theorem 1 holds, it is sufficient to show for each 
template in %jr, that: 

1. there exists a white position 6-adjacent to the central position, 

2. for any two white positions 6-adjacent to the central position p are 6- 
connected in the set of white positions 18-adjacent to p, 

3. and for any potentially black position 6-adjacent to the central position p, 
there exists a 6-adjacent white 18-neighbor which is 6-adjacent to a white 
position 6-adjacent to p. 

The three points are obvious by a careful examination of the templates in 

We know that each deletable point p is simple. It can be stated that the 
value of any point coinciding with a potentially black template position does not 
alter the simplicity of p. We can state that the simplicity of a point p does not 
depend on the points that coincide with a template position marked , “x” , or 
“y” (see Fig. 4). In addition, black points that coincide with template positions 



416 



Kalman Palagyi 






Fig. 9. Configurations assigned to Observation 1. Note that Observation 1 holds 
for their rotations around the vertical axis, too. (The rotation angles are 90°, 
180°, and 270°.) 



marked are nondeletable (by Observation 1/1). Therefore, it is sufficient to 
deal with deletable points that coincide with template positions marked 
Note that base templates Tl, T4, T5, and T8 (and their rotated versions) do 
not contain any positions marked Therefore, only base templates T2, T3, 
T6, and T7 (and their rotated versions) are to be investigated. It is easy to see 
with the help of Observation 1 that deletion of points coinciding with template 
positions marked do not alter the simplicity of point p. Therefore, Condition 
1 of Theorem 2 is satisfied. 

Condition 2 of Theorem 2 can be seen with the help of Observation 1/2, too. 
Let us consider a unit lattice cube containing an upper set of four points U = 
{^ 1 , ^ 3 , U 4 } and a lower set of four points L = {/i, / 2 , ^ 3 , ^ 4 }- Let C C UUL be 

a black component contained in the unit lattice cube. If CnL contains a deletable 
point then C H U ^ 9 by Observation 1/2. It is easy to see, that any point in 
CnU is nondeletable by Observation 1/2. Therefore, black component C cannot 
be deleted completely. □ 
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Abstract. Medial Axis, also known as Centres of Maximal Disks, is a 
representation of a shape, which is useful for image description and anal- 
ysis. Chamfer or Weighted Distances, are discrete distances which allow 
to approximate the Euclidean Distance with integers. Computing medial 
axis with chamfer distances has been discussed in the literature for some 
simple cases, mainly in 2D. In this paper we give a method to compute 
the medial axis for any chamfer distance in 2D and 3D, by local tests 
using a lookup table. Our algorithm computes very efficiently the lookup 
tables and, very important, the neighbourhood to be tested. 

Keywords: Medial axis. Centres of Maximal Disks, Chamfer Distances, 
Lookup Table Transform, Shape Representation. 



1 Introduction 

A digital image is a set of colored points in a space, for instance the voxels 
on the cubic grid. A shape in a binary image, is a subset of the points of the 
image, sharing the same color. The points of the shape, taken individually, do 
not provide any global information for the shape description. 

Given a family of disks and a shape, we consider the disks completely included 
in the shape. If the smallest disk in the family is a single point, then the shape 
can be completely covered by such disks. Coding their positions and sizes is 
sufficient to reconstruct the original shape. 

Now consider a covering of the shape by maximal disks, defined as follows: 

Definition 1. A disk is maximal in the shape if the disk is not included in any 
single other disk in the shape. 

The maximal disks are centred in the shape. Note that a maximal disk can 
be included in the union of other maximal disks; so the covering by maximal 
disk, which is unique by construction, is not always a minimal covering. 

Definition 2. The Medial Axis (MA) of a shape is the set of centres of maximal 
disks in the shape. 
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The medial axis is a reversible coding of the shape; it is a global representa- 
tion, centred in the shape, which allows shape description, analysis, simplification 
or compression. The medial axis is often not thin and disconnected. It is a step 
for weighted skeleton computation [12] and implicit surface reconstruction [7]. 





The aspect of the medial axis and its computation depend on the geometry 
of the family of disks, which can be circles, rectangles, distance balls, etc. For 
instance, figure 1 schematizes a medial axis covering a shape with circles. 

The medial axis is especially attractive when the disks are distance balls, 
for computation cost and for description purpose. A distance is positive defined, 
symetric and respects the triangular inequality. Given a distance d, the distance 
ball Bd of centre p and radius r, is the set of points 

Bd{p,r) = {q : d{p,q) <r} (1) 

which is centre-symetric. There exists innumerable distances functions; since 
digital images are the most often encoded in integer arrays, we focus on discrete 
distances, which are distances working with integers. 

2 Chamfer Distances 

Chamfer distance dc can be defined in the following way: a chamfer mask A4c is 
a set of legal displacements in a neigbourhood, each displacement being weighted 
by an integer cost; the chamfer distance dc between two points is the cost of the 
path of least cost joining them, formed with legal displacements in the mask. 

Borgefors popularizes chamfer distances in [2] , in any dimension. Afterwards 
many optimization methods have been proposed, to approximate the Euclidean 
distance dE] the major contribution is due to Verwer in 2 and 3 dimensions [13]. 
One can find in [12] a complete history of chamfer distance, the comparison 
of different optimization methods and crossing formulas between them. More 
recently, we propose in [8] new results in the 3D case, using Farcy triangulations. 

Chamfer distances have many advantages, which justify their success in ap- 
plications. They are local distances, that is to say, which permit to deduce a 
distance from the distances of close neighbours, unlike dE- All computations are 
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done using integers and linear operations As we will see, the compu- 

tation of the medial axis can also be done by local tests. 

The major attraction is the high speed — and simplicity — of the distance 
transform algorithm, denoted DT, due to Rosenfeld et ah [10]. The DT consists 
in labeling each point of a shape to its distance to the complementary. The DT is 
global, and operates in 2 passes on the image, independently of the thickness of 
the shape in the image, and of the dimension. The Reverse Distance Transform 
(RDT) allows to recover a shape from it’s medial axis, also in 2 passes. 
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Fig. 3. First visible points 




Our work space is the cubic grid, associated with the fundamental lattice A 
of Z^. The cubic grid implies the symmetry towards planes of axes and bissec- 
trices, called 48-symmetry : it divides Z^ into 48 sub-spaces (48 = 2^.3! with 2^ 
sign combinations and 3! coordinates permutations), versus 8 octants in Z^. We 
denote S the 48^^ of space represented in figure 2: 

S =( 0 < z < y < X , (x, y, z) e Z^ j . (2) 

We call weighting a displacement ~if{x^y^z) G Z^ associated with a 

weight re G N*, also denoted In our work space, a chamfer mask Me is 

a 48-symmetric set of m weightings 

Me = { Mi{xi, yi, Zi,Wi) , l<i <m] . (3) 

The generator of a mask A4c is the part A4c H <S, from which are deduced 
all other weightings by the 48-symmetry. The cardinal of is denoted rrig. 
Given a displacement 1/ in we name the corresponding displacement 
in M^fj by the 48-symmetry. 

A weighting (x, z, w) generates by translation the periods (2x, 2^, 2z, 2ie), 
(3x, 3^, 3z, die), etc. For the sake of efficiency during DT, it is self-evident that 
M^(j should only be formed of points such that ged{x, y,z) = 1. The points 
having this property are said visible (from the origin, see [5]). The set of visible 
points of S can be obtained with a sieve upon the periods of visible points, by 
scanning S on x,y,z. Visible points are named a, b, c, . . . in the sieve order. We 
give figure 3 the cartesian coordinates of the first visible points in S. Properties 
of the choice of visible point subsets in a chamfer mask are studied in [8,9]. 
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3 Existing Methods to Extract MA 

3.1 Local Maxima 

After the DT, each shape point p is labeled to its distance DT[p] to the comple- 
mentary. Let 1 / be a displacement of the mask Ate- The point p+l/ is deeper 
inside the shape than p (see figure 4) if DT\p +V] > DT\p]. Because of the 
definition of the chamfer distances, the greatest possible value of DT[p^^] is 
DT[p] + W[V]. If this hapens, then the point p propagates to p the dis- 
tance information during the DT. We deduce that the disk centered in p + 
completely overlaps the disk centered in p (figure 4), thus p ^ MA. 

On the contrary, if p does not propagate any weighting, then p is called a 
Local Maximum. Such a point verifies 

DT[p + V] < DT[p] W[V] , Vlf e Me (4) 

which we name Local Maximum Criterion (LMC). The set of points detected by 
the LMC, includes the MA by construction. Rosenfeld and Pfaltz showed in [10] 
that for the basic distances such that a = 1 ((I 4 and dg in 2D, de, dig and 6/26 in 
3D), the LMC set is exactly MA. 



3.2 Equivalent Disks 

This is no more true from the moment that a > 1 , since the LMC detects the 
MA plus erroneous points, which are not center of maximal disks. The LMC set 
is still reversible ; but the erroneous points are generally numerous, in particular 
close to the border of the shape, and they make the MA completely unusable 
for applications. 

The trouble comes from the check in (4) in the difference between disk radii 
on DT. In fact, if a > 1, then several radii may correspond to a same disk, ie. a 
given disk may have an interval of equivalent integer radii. The computation of 
the equivalence classes is related to the Frobenius problem [11,6]. For instance, 
the class of equivalence of the single pixel ball is [1 . . a]. The corollary is that (4) 
is inadequate for a > 1 . 

Arcelli and Sanniti di Baja showed in the 2 D case for 3 x 3 masks that 
it is sufficient to bring down each value on the DT to the lowest term in its 
equivalence class ; the LMC is then exact on the modified DT. For instance, 6 / 3,4 
simply needs to bring the 3 down to 1 and the 6 down to 5 [1]. Their method is 
inappropriate for masks greater than 3 x 3 in 2D and 3 x 3 x 3 in 3D, because 
of the appearance of influence cones in chamfer balls [ 12 ]. 



3.3 Lookup Tables 

The most general and efficient solution is the method of the lookup tables, which 
stores the corrections to the LMC. 

A shape point p is a maximal centre if there is no other shape point q such 
that the ball Bd{qj DT[q]) entirely overlaps the ball Bd{p, DT\p\). The presence 
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of q forbids p to be a MA point. Suppose that (i) it is sufficient to search q in a, 
local neigbourhood of p and (ii) that we know for each DT\p] the minimal value 
DT[q], stored in a lookup table Lut, which forbids p in direction 1 / = p^. 

(i) The local neighbourhood of vectors to be tested is denoted Mlui and is 

48-symmetric. The generator of Mhut is denoted Given V G M.Lut^ 

we name the corresponding vector by the 48-symmetry in 

(ii) The minimal value for p and is stored in Lut[~^][DT[p\\. Because of 
the 48-symmetry, it is sufficient to store only the values in M^ut ’ ti^ace the 
minimal value for p and if is accessed using Lut[~lf^][DT\p]]. 

Finally we have the following criterion: 

p G AM DT\p-\-lf] < Lut[~tf^][DT\p\] , W G Mlui • (5) 

The first use of lookup tables is due to Borgefors, Ragnemalm and Sanniti 
di Baja in [4], for cLe in 2D. The lookup tables are computed with exhaustive 
search; the combinatory is enormous, but the computations are done once for 
all. The tables are given for radii less than a/ 80- Borgefors gives in [3] the lookup 
tables for the 2 D distance 0 ^ 5 , 7 , 11 , which entries differs from the EMC for radii 
less than 60; but she does not generalize her lookup table computation method. 

One of the authors proposes in [ 12 ] an efficient algorithm to compute the 
lookup tables for any chamfer mask in 2 D, assuming that Mhut = Me- But he 
points out that for large masks, erroneous points may be detected: a contradic- 
tion, and a fondamental question about the validity of the whole method. 

We have recently discovered that the assumption Meui = Me actualy is 
the error. In fact, the two masks often completely differ, and we propose in the 
following the correct and efficient algorithm which computes both Mlui and 
Lut in 3D. The following method is immediately applicable to the 2D case by 
skipping any reference to z. 

4 Proposed Method to Compute Lookup Tables 

Let us come back to the meaning of the labels on the DT image. A point p is 
labeled to its distance r = DT[p] to the complementary. If we apply the RDT 
on p, it generates the reverse ball denoted r), which is the set of points 

= {9 : > 0} (6) 

and which is the greatest ball inside the shape, centred in p. From (I) and ( 6 ) 
it comes immediately the following lemma: 

Lemma 1. Bd{p,r) = B^^{p,r 1) . 

While the sets are the same, it is important to note that resulting labels have 
different values on DT and RDT, as demonstrated figure 5. 

The computation of an entry Lut[~Jf][r] in the lookup table for r = DT[p] in 
a direction if, consists in finding the smallest radius R of a ball 5^^(p-hlf , R) 
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Fig. 7. Overlapping test on two 
balls restricted to S 




Fig. 8. Translated overlapping 
test on DT^ 



which overlaps completely (see figure 4). As illustrated figure 7, a 

method to find R is to decrease the radius while the ball overlaps, but each 
step needs a RDT, and the cost is prohibitive. The lemma 1 is the starting point 
of our method: we will show that it is sufficient to compute the DT in S only a 
single time at the beginning; the lemma 1 allows to test the overlappings on S. 



4.1 Computing an Entry of the Lookup Table 

We choose to restrict our study to balls which are convex and sy metric, ie. to 
distances inducing a norm (see [ 8 ]). Therefore, we can limit the overlapping test 
by restricting the two balls to <S, which gives the figure 7. This loss of generality 
should not be annoying since non-norm distances are not homogeneous (the 
shortest path may zigzag over the grid). Note however that it is still possible not 
to restrict to S leading to a greater computationnal load. 

Instead of reducing the radius while the ball overlaps in figure 7, we 
propose to translate both B~^^{p^r) and B~^^{q^R) to the origin as shown in 
figure 8 . We scan each point p\ of B~^^{0^ r) H <S, which by translation of vector 
gives p 2 - Finally, we find 

R = max { d{0,p2) : P 2 = Pi + , Pi ^ B~^^{0,r) n } (7) 

= max { d{0,pi + V^) : pi G B^^{0,r) n 5 } . ( 8 ) 
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This process can be very efficiently implemented, using a correspondence 
between figure 8 and the image CT^, which is the result of the Cone Transform. 
It gives for any point of S its distance from the origin, see figure 6. a. For any 
chamfer distance that induces a norm (see [9,8]), a fast algorithm (given in 
figure 9) exists that computes CT^ in a single pass, only using 



1 


PROCEDURE ComputeCT® ( L, Cone ) ; 

Cone[(0, 0, 0)] = 0 ; 




2 


FOR a; = 1 TO L - 1 , FOR y = 0 TO a: , FOR 2 = 0 TO 


y DO 


3 


{ 




4 


min = +00 ; 




5 


FOREACH If® IN DO 




6 


{ 




7 


{x',y',z') = (x,y,z) - If® ; 




8 


IF {x',y',z) G S AND Cone[{x' ,y' , z')] + W {U 


'®) < min THEN 


9 


min = Cone[{x ,y' , z')] + ; 




10 


} 




11 


Cone[{x,y, z)] = min ; 




12 


} 





Fig. 9. Fast Cone Distance Transform Algorithm. Input: L the side length, 
the generator of the dc mask. Output : Cone the distance image containing 
the cone 



PROCEDURE ComputeLutColumn ( Cone, L, Rmax, Lut[lf^] ) ; 

1 FOR r = 1 TO Umax DO Lut[lf^][r] = 0 ; // Initializes Lut[lf^] to 0 

2 FOR X = 1 TO L - 1 , FOR 2/ = 0 TO X , FOR z = 0 TO y DO 

3 { 

4 ri = Cone[{x, y, z)] + 1 ; // radius of the ball where pi is located 

5 V 2 — Cone[{x, y, z) + + 1 ; // same for p 2 

6 Lut[lf^][ri] = MAX {Lut[lf^][ri],r 2 } ; 

7 } 

8 T2 = 0 ; 

9 FOR n = 0 TO Umax DO 

10 IF Lut[l^^][ri] > T 2 then T 2 — Lut[~Jf^][r\\ ELSE Lut[~^^][r\\ — V 2 ; 

Fig. 10. Lut Computation Algorithm. Input : Cone the cone, L the side length, 
the column of Lut to be computed, Rmax the greatest radius value seekable 
in Lut. Output : Lut[lf^] is filled with the correct values 



The implementation of Lut computation shown in figure 10, consists in a 
scan of S (line 2-7). For each point pi, we look for the corresponding radius r\ 
which is CT^\pi] + 1 because of lemma 1. We then look for the radius T 2 of 
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the ball passing by point p 2 ] its value is CT^\p 2 ] + 1 = CT^\pi + + 1, also 

because of lemma 1. During the scan, we keep the greatest value found for r 2 , 
which at the end, is R by (8). 

Due to the discrete nature of the balls, one can observe in Lut, cases where 
for instance ri < V 2 while Lut[~tf^][ri] > Lut[~Jf^][r 2 ]^ which means that the ball 
overlapping Bd{ri) is bigger than the ball overlapping Bd{r 2 ). These artefacts of 
discrete distances should not happen because it has been proved in [13], p. 20, 
that any dc is a distance function, and thus 

Vri,r2 ri <T2^ Bd(ri) C Bd(r2) . (9) 

We therefore have to correct the table by assuming that in this case, Lut[~^^] [r 2 ] 
should at least equals Lut[~^^][ri] (figure 10, lines 8-10). 



4.2 Computing the Mask of the Lookup Table 

We now focus on the computation of the set of weightings M-luv 

We assume that a given set is sufficient to extract correctly the MA 

from any DT image which values does not exceed Rkhowu^ ie. this enables 

to extract from any ball Bd{0^ r) where r < Rkhowu^ a medial axis which is (by 
definition 2) the sole point O (the origin). At the beginning, is empty 

and R Known — 0* 

We propose to test each ball Bd{0, r), where r > RKnown^ each time extract- 
ing its DT and then its MA, until whether r reaches Rrargeu or a point different 
from O is detected in the MA of Bd{r). If r reaches R Targets then we know that 
•^Lut enables to extract MA correctly, for any DT containing values lower or 
equal to RTarget- 

On the contrary, if we reach the case where (at least) one extra point p is 
found in MA, it means that M-^ut sufficient to correctly extract MA. We 

propose to introduce the new weighting (p, CT^[p]) in If this new mask 

is sufficient to remove p then the process continues. If not, there is no possible 
Lut for this distance and the process stops, reporting the problem (this should 
never happen if d actualy induces a norm). 



4.3 Related Algorithms 

The full algorithm given figure 11, uses two other algorithms given figures 12 
and 13. They are dedicated versions (limited to S and thus 48 times faster) of 
the classical Distance Transform and Medial Axis Extraction. 

Note that the use of DT^ (figure 11, line 7) is mandatory, since the MA is 
extracted from the DT to the complementary (see §2 and §3). In fact, a simple 
threshold on image CT^ to the radius R gives only the Bd{0,R) H S set, but 
not the correct labels (see figure 6, where values of (a) differ from (b)). 
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PROCEDURE ComputeAndVerifyLut ( L, RKnown, Rrarget ) ; 

1 Compute(7T^(L, Cone) ; 

2 Rmax = MAX {Cone\p\ : p G Cone} ; 

3 FOREACH IN DO ComputeLUTColumn ( Rmax, Lut ) ; 

4 FOR R — RKnown + 1 TO Rrarget DO 

5 { 

6 Vp, DT ^ H = { J ^ ; // Copy Sd(i?) n 5 to DT® 

7 ComputeOT^(A4^, DT^) ; 

8 FOR X = 1 TO L , FOR 2/ = 0 TO X , FOR z = 0 TO y DO 

9 IF IsMA ( (x,y,z), Lut, DT^ ) THEN 



10 


{ 




11 


M = {x,y,z,Cone[{x,y,z)]) ; 


// Build a new weighting M 


12 


= XL, U {M} ; 


// Add M to 


13 


ComputeLUTColumn ( (x,y,z), Rmax, 


Lut[(x , y , z)] ) ; // and Lut 


14 


IF IsMA ( ix,y,z), Lut, DT^ ) 


\ THEN ERROR ; 


15 


} 




16 


} 




17 


RKnown — RTarget 





Fig. 11. Full Lut Computation Algorithm with determination of In- 
put: L the side length of the images, the generator of the Lut neigh- 

bourhood, RKnown the last verified radius, Rrarget the maximum radius to be 
verified. Output: Lut the lookup table, Rxnown the new verified radius 



5 Results 

While the computation of the Lut array in figure 10 is very fast (less than a 
second^), the computation of M^^ut figure 11, involving its verification, is 
rather slow, as shown below, and its result should hence be saved for further 
usage. It takes approximatively six minutes (370s^) to compute the for 

distance <iii,i 6 ,i 9 ,j =45 for L = 100 and from R Known = 0 to Rrarget = 1100 (the 
radius of the biggest Bd C S possible in the L^ sized image CT^). This load 
is explained by the systematic test of 1100 balls Bd{r). Each of them involves 
computations (CT^ and MA extraction) on o(r^) points. It is therefore much 
more interesting to use chamfer distances with small values of W[a] since this 
gives fewer balls to test and thus a faster result. 

In this scope, in most cases, it is interesting to compensate for the quality 
loss by more weightings in A4^. For example, the extraction of M-^ut 
tance 0 ^ 7 , 10 , 12 , d= 16 ,e =17 for L = 100 from RKnown = 0 to Rrarget = 700 is 33% 
faster, while achieving a better approximation of dr- 

We give in figure 14 some examples of Lut arrays where 
The tables show only the values which differ from the EMC (ie. R -h W[~df^] ^ 
Lut[~df^][R], see (4) and (5) ), and thus represent the irregularities. 



^ On a SGI Octane/IRIX 6.5, Mips RIOOOO 180 MHz 
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PROCEDURE ComputeDT® ( L»T®, L, ) ; 

1 FOR z = L - 1 TO 0 , FOR y = L - 1 TO z , FOR x = L - 1 TO y DO 

2 IF DT® [{x, y, z)] ^ 0 THEN 

3 { 

4 min = +oo ; 

5 FOREACH if® IN Af^ DO 

6 { 

7 IF (x,y,z) + If® e 5 THEN m; = DT®[(x,j/,z) + U"®] + fh(lf®) 

8 ELSE w = +00 ; 

9 min = MIN {w,min} ; 

10 } 

11 DT^[{x,y, z)] = min ; 

12 } 



Fig. 12. Fast Distance Transform in S. Input; DT® the shape (limited to S), L 
the side length, Mq the generator of the dc mask. Output; DT® the distance 
map 



FUNCTION IsMA ( p, Mi^^, Lut, DT^ ) ; 

1 FOREACH If® IN DO 

2 IF DT®[p+ If®] < Lui[lf®][DT®[p]] THEN RETURN FALSE ; 

3 RETURN TRUE ; 



Fig. 13. Fast extraction of MA points from Bd H S. Input: p the point to test, 
MLt the generator of the Lut neighbourhood, Lut the lookup table, DT^ the 
distance transform of the section of the ball. Output: returns TRUE if point p is 
detected as MA in DT^ 

The figure 15 shows a full example of both the computed mask and 

the Lut array for distance dnp6,i9,j=45- One must note the difference between 
M% and and the presence of point 3 e which is not a visible point, and 

thus would not have appeared in as seen in section 2 . We finally give in 
figure 16 the distance dig^27,33,d=43,e=47 5 which is as in figure 15 , very 

different from its corresponding 

6 Conclusion 

In this paper, we give the solution to the fundamental problem pointed out 
in [ 12 ], where the computed lookup tables detected erroneous MA points for 
large masks. We introduce a new mask M.Lut^ different from the mask Me used 
to compute DT. We present and justify our algorithm to compute M Lut and Lut. 
The correctness of the method is verified during the computation of M Lut and 
thus ensures that the medial axis is free of the error points encountered in [ 12 ]. 
We use the symetry of the ball of chamfer norms to minimize computations. 
Finally, we give results for various chamfer norms given in [ 8 ]. 
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Fig. 14. Examples of Lut of 4 dc for which = ^\ut 
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Abstract. Multiresolution modelling of polygonal surface meshes has 
been presented as a solution for the interactive visualisation of scenes 
formed by hundreds of thousands of polygons. On the other hand, it has 
been shown that representing surfaces using sets of triangle strips or fans 
greatly reduces visualisation time and provides an important memory 
savings. In this paper we present a new method to model polygonal 
surface meshes. Like the previously explained Multiresolution Ordered 
Meshes (MOM), this method permits the efficient management of an 
ample range of approximations of the given model. Furthermore, this 
method utilises the triangle fan as its basic representation primitive. 
Experiments realised with data sets of varying complexity demonstrate 
reduced storage space requirements, while retaining the advantages of 
MOMs. 



1 Introduction 

One of the principle objectives of multiresolution modelling [1] is to permit in- 
teractive visualisation of surfaces formed by thousands of polygons. Given a 
mesh, M, a multiresolution model defines how to store and retrieve n differ- 
ent approximations or levels of detail (LOD), Mo,Mi, in an efficient 

manner. 

Several mechanisms have been developed to accelerate the process of visual- 
ising polygonal models. For example, strips and fans of triangles (see figure 1) 
appear as drawing primitives in some graphics libraries, such as OpenGL. This 
type of primitives allow for rapid visualisation. To draw a fan of n triangles, for 
example, it is only necessary to pass n+2 vertices, instead of 3n^ to the graphics 
processor. This not only reduces computation time due to a reduction in vertices, 
but also an important memory savings. 

Obtaining the optimal set of strips or fans for a given surface is a process 
realised off-line when working with static models. However, when working with 
a multiresolution model, the surface connectivity changes with changes in level 

* Supported by grant TIC1999-0510-C02-02 (CICYT, Ministerio de Educacion y Cien- 
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Fig. 1. Example of a strip and a fan of triangles. On the left, a strip defined by 
'eO, 'L’l, 'L’2, v3, 'L’4, vb and on the right, a fan defined by t’O, 'el, 'e2, 'e3, 'e4, 'e5, 'e6, vl 





Fig. 2. Visualisation process of a LOD in VDPM 



of detail, probably with each frame, therefore requiring the dynamic generation 
of the strips or fans. 

This article presents a new multiresolution scheme permitting the recuper- 
ation of a level of detail, directly as a set of triangle fans. It is based on the 
Multiresolution Ordered Meshes (MOM) model presented earlier [2]. The new 
scheme, called MOM-Fan^ defines a new data structure and a new traversal al- 
gorithm which optimises the triangle fans for visualisation at the LOD required. 

Notation. The geometry of a triangulated model, M, is denoted as a tupla 
{V,.F}, where V is a set of N positions Vi = Zi) G R^, 1 < i < V, and T 

is a set of triples {j, /c, /}, j, k,l specifying positions of triangles faces. 

2 Previous Work 

Hoppe [3] presents a multiresoution model, VDPM^ based on a hierarchical struc- 
ture of vertices built from a sequence of contractions of edges. The level of detail 
is determined from a series of criteria based on the view frustum, surface ori- 
entation, etc. Changes in these conditions trigger changes in the required LOD, 
and it is proposed that triangle strips be generated using a greedy algorithm 
once the component triangles are determined for that LOD (figure 2). 

El-Sana et al. [4] presents a data structure, Skip-Strips^ that maintains trian- 
gle strips even though the LOD may change. A Skip-Strip is built at run time, 
from the multiresolution model. At the same time the triangle strips from the 
original model are obtained. Each time the level of detail changes, the Skip-Strip 
structure is updated based on the required LOD, which permits the update of 
the strips and their subsequent visualisation (figure 3). As the strips are gen- 
erally quite short, it has been proposed that they be concatenated previous to 
visualisation. 

The proposal in this article simplifies the LOD visualisation scheme. The 
multiresolution model itself is encoded using triangle fans, and therefore it is 



Multiresolution Modelling Using Triangle Fans 433 




Fig. 3. Visualisation process of a LOD in Skip-Strips 




Fig. 4. Visualisation process of a LOD using MOM-Fan 



necessary to make some adjustments to each fan according to the LOD (fig- 
ure 4). Figure 9 illustrates three LODs of a MOM-Fan object, where the trian- 
gles composing each fan have been coloured alike and the boundaries have been 
highlighted. 

Other multiresolution models exist which do not make use of either triangle 
fans or strips; see [1] for a recent survey of these. 



2.1 Review of Multiresolution Ordered Meshes 

Multitresolution Ordered Meshes was presented with the idea of improving the 
interactive visualisation of complex polygonal surfaces. Later, it was extended to 
exploit frame-to-frame coherence [5]. This permitted the acceleration of LOD re- 
covery, while not affecting visualisation time. Finally, in [6] MOM was compared 
against Progressive Meshes [7]. 

Let M and M^ be the original and multiresolution meshes, respectively. M^ 
explicitly stores the vertices and the faces ^ utilised to represent any 
resolution: 

M^ = ( 1 ) 

To build M'^ with n levels of detail, we apply n— 1 iterations of a simplification 
method. Each simplification 5^,0 < i < n— 1, produces a new level of detail M^+i 
and may be represented by the tuple Si = {Vi^Fi^V( ^F[} where V and Fi are 
the sets of vertices and faces which are eliminated from Mi^ and and F' are 
the sets of vertices and faces which are added to Mi to gerenerate, finally, M^+i. 
Therefore, we may express the resulting object, M^+i, as: 



M,+i = (M,-{V,FJ) U 0<^<n-l 



( 2 ) 
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Given that stores all vertices and faces that can be used at any set level of 
detail, M'^ can be defined as: 



n— 1 

=[jMi, n>l (3) 

i=0 

From the equations 1 - 3, we derive that M'^ can be expressed as the initial 
mesh, M = Mq, plus all vertices and faces generated in each iteration of the 
simplification process: 



n—2 

V" = Vo u V' U VI U ... U v ;_2 = Vo u y VI (4) 

2=0 

n—2 

.F" = J-oUF'uFi'U...UF'_ 2 = J-oU y F' (5) 

2=0 

or also, as the mesh corresponding to the worst level of detail, M^-i, plus the 
vertices and faces eliminated in each iteration of the simplification process: 

n—2 

V" = Vo U Vi U ... U Vn-2 U Vn-1 = y Vi U Vn-l (6) 

2 = 0 
n-2 

= FqVJ ... U Fn-2 U Fn-l = (^ Fi U Fn-1 (7) 

2=0 

The basic idea of MOM is based on the expressions of the two previous 
equations. That is, store in ordered form the sequences of vertices and faces 
eliminated, Vi and 0 < z < n — 2, plus the vertices and faces corresponding 
to the worst level of detail Mn-i = {Vn-i,Fn-i}. Each stored face is identified 
by a value representing its position in the face sequence ordered according to 
equation 5 above. MOM-Fan is based on the same idea, the difference being 
that here we store and manipulate triangle fans instead of isolated triangles. In 
section 3 we show how to store the fans in the data structure, and in section 4 
how to recover those fans which form a given level of detail. 

3 Data Structure 

The data structure presented here is based on a list of lists similar to that 
described in [2]. The fundamental difference is the list which stores vertex se- 
quences, which substitutes the anterior list of faces. In figure 5 the data structure 
of the model is shown. The data structure is formed of three lists: 

— Vertex list {vertexList field). Stores the vertices of the mesh in an ordered 
fashion according to their elimination in the simplification process. Each 
represents the initial vertex of a fan and consists of its co-ordinates {coord 
field) and a pointer to the second vertex in the fan {secondF anV ertex field). 
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struct Vertex 
float coord [3] ; 
int secondFanVertex; 
end struct 

struct FanVertex 
int id; 

struct Vertex *v; 
end struct 



struct ControlLod 

int deletedVertices ; 
int generatedFaces; 
end struct 

struct MOM-Fan 

int nVertices, nFanVertices , nLods; 
struct Vertex vertexList [] ; 
struct FanVertex f anVertexList [] ; 
struct ControlLod controlLodList [] ; 
end struct 



Fig. 5. MOM-Fan Data Structure 



— Fan list {f anVertexList field). Stores the fans as vertex sequences (except 
the initial vertex, already stored in the vertex list). For each fan we store 
the vertices of the triangles which disappear when the common vertex is 
eliminated. Each stored vertex consists of a pointer to the vertex in the 
vertex list {v field) and an identifier (id field). The identifier references the 
face of the fan represented by that vertex. The fans are also ordered according 
to their elimination in the simplification process. 

— Control List (controlLodList field). For each LOD, this list stores the infor- 
mation necessary to recover the data pertaining to that LOD. 



3.1 Construction Process 

The process of constructing the model is divided into 2 steps. The first involves 
the tasks which must be repeated during each iteration of the simplification. The 
second includes a group of operations which finish building the multiresolution 
model M^. To simplify the explanation let us assume that each simplification 
realised eliminates only one vertex. In figure 6 we show the mesh to be utilised in 
explaining the construction, initialisation, the result of the first stage -eliminating 
vertices v4 and vl- and the result of the second stage. 

Initialisation. Before beginning this process it is necessary to initialise M'^ 
with Mo, as it is the first level of detail. For this it is necessary to update 
controlLodList with data from the mesh Mq, filling in the fields deletedVertices = 
0 and generatedFaces = \To\. The vertex list and fan list are empty. 

First stage. For each iteration in the simplification process the following 
tasks are realised: 

1. Store in f anVertexList the vertices which form the fan, except for the vertex 
eliminated (common vertex). 

2. Store in vertexList the eliminated vertex and place the pointer at the first 
of the remaining vertices of the fan. 

3. Update controlLodList with the deleted vertices and the total number of 
generated faces. 
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controlLodList vertexList 



deletedVertices 




1 


2 


coord 


v4 


vl 


v3 


vO 


v2 


v5 


v6 


v7 


v8 


generatedFaces 




13 


16 


secondFanVertex 


0 


7 


13 


18 


~ 


~ 


~ 


~ 


~ 



fanVertexList 




Fig. 6. Example of model construction. Shaded cells indicate initialisation. In 
dark grey, data added after the elimination of f4, and in light grey, data added 
after the elimination of fl, both in the first stage. In white, data added as a 
result of the second stage 



Second stage. To complete the data structure: 

1. Add the mesh containing the worst LOD, as a set of fans, in the same manner 
as in the first stage. The vertices that do not have an assigned fan are put 
at the end of the list. 

2. Update the pointers of the fan vertices so that they point to the vertices in 
the new vertex list. 

With this step we conclude the construction of the data structure, which we can 
generalise as being the elimination of more than one vertex in the simplification 
process similar to that in [2]. 

3.2 Storage Cost 

Assuming that the cost of storing a real, an integer and a pointer is one word, 
and that we store three lists, one for each data type in the model (see figure 5), 
the total storage cost will be 4\V^\ + 2|^^| + 2n words, being the size of 
fanVertexList. The order of \V^\ (see eq. 4) in the worst case is quadratic with 
regard to |Vo|. The best case appears when we use a simplification method based 
on vertex decimation [8], where = |Vo| and may reach = 2|Vo| in the 
case of simplification methods based on elimination of edges [9]. 

With regard to \ in the worst case, the cost also is of quadratic order with 
respect to If we use a simplification method based on vertex decimation 
and we assume that the number of faces around a given vertex to be an average 
of 6 and that jV^I \Ti\/2^ 0 < i < n — 1, we have \Tf \ ^ 3.5|.Fo|. With a method 
based on elimination of edges, ~ 5.5|.Fo|. 
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for each vertex v from DV and while there exist triangles to paint 
InterruptionFan <- true {force the a.l case} 

for each vertex vi of the fan associated with n, from the first to the penultimate 
if fanVertexList[ui].id < GF then {case a) paint Vi} 
if InterruptionFan then 

paint(v) {case a.l) paint the initial vertex} 

InterruptionFan <- false {do not repeat case a.l)} 
end if 
paint {vi ) 

NF <- NF-1 

else {case b) do not visualise vi} 
if no InterruptionFan then 

paint (uz) {paint Vi to close triangle} 

InterruptionFan <- true {force case a.l)} 
end if 
end if 
end for 

if no InterruptionFan then 

paint (uG {last vertex Vi from the list, to close fan} 
end if 
end for 



Fig. 7. Visualisation algorithm 



4 Algorithm for Visualising a LOD 

The visualisation algorithm of a LOD, which is shown in figure 7, requires the 
initial calculation of the number of triangles to visualise at that LOD. Afterward, 
it traverses the data structure recovering the triangle fans pertaining to the 
required LOD. 

Given a LOD k in the control list we store the number of vertices not per- 
taining to A:, DU, and the number of generated faces, GF. Based on these data 
it is trivial to obtain the number of faces, NF^ which pertain to k. 

DV= controlLodList [k] . deletedVertices ; 

GF= controlLodList [k] . generatedFaces ; 

NF= GF- (vertexList [DV] . secondVertexFan- DV) 

Using the example in figure 6, suppose we wish to visualise the best LOD 
(Mo). The fans to be generated, ordered by the appearance of vertices in the 
data structrure, are: (v4 vl v3 v5 v6 v8 v7 vl) which correspond to the faces (f2 
f3 f4 f8 f7 f6); (vl v2 vO v3) and (vl v7 v2), which correspond to the faces (fO 
fl) and (f5), respectively. The first vertex of the fan always corresponds to the 
eliminated vertex. The remaining vertices are stored in fan VertexList^ and are 
associated with the eliminated vertex. For each fan only the t first vertices of 
t + 1 vertices composing it are processed, because each of them represents one of 
the t faces represented whereas the last vertex serves only to complete the fan. 

However, it may occur that some vertices do not pertain to the required LOD. 
In the example, because flO and f9, implicitly associated to vl, should not be 
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visualised, there is a jump from vertex v3 to v7. To resolve these jumps without 
splitting fans, it is necessary to introduce the vertex vl between v3 and v7, 
thus producing two degenerate triangles. While the first fan is resolved without 
degenerate triangles (v4 vl v3 v5 v6 v8 v7 vl), the second fan (vl v2 vO v3 vl v7 
v2) includes two of them, (vl v3 vl) and (vl vl v7). Upon processing each of the 
vertices two things may occur: a) the vertex identifier indicates that the triangle 
should be painted, or b) the vertex identifier indicates that the triangle should 
not be painted. When the second case occurs, this signifies an interruption in 
the fan, and if this continues further along it is necessary to insert the initial 
vertex. Therefore, the first time that case a) is encountered after an interruption 
the insertion should be realised (case a.l in the algorithm). 

The computational cost of the algorithm to extract LOD k depends on the 
total number of vertices not eliminated, which is at most — /c, and the total 
number of vertices of the associated fans, which is at most — 2k. jV’^l and 
\T^\ are, in the worst case, of quadratic order with respect to |Vo| and \Tq\, 
respectively, but this case differs substantially from the normal case. With the 
method of simplification by elimination of vertices the cost is 0(8|V/c|) and with 
a method based on edge elimination, 0(13|V/c|). 



5 Results 

The experiments were realised utilising a Silicon Graphics RealityEngine 2, with 
a MIPS RIOOOO at 194 MHz and 256 Mb RAM. Coding of the model was in C+ + 
and utilised OpenGL as its graphics library. The simplification method used to 
construct the multiresolution representations is that proposed by Garland and 
Heckbert [9] based on contraction of edges. The meshes come from the Stanford 
University Computer Graphies Laboratory (http://www-graphics.stanford.edu/ 
data/3Dscanrep/) and Cyberware (http://www.cyberware.com/models/). 

In table 1 we summarise the characteristics and storage costs of the objects 
used in the experiments. For each of them we indicate the number of vertices 
and faces of the original model, and its storage cost assuming a structure based 
on a vertex list and a triangle list [10]. Also it is assumed that a word (integer, 
real, or pointer) carries a set cost of 4 bytes. With regard to the multiresolution 
ordered mesh (MOM) representation, we indicate the number of levels of detail, 
the number of faces and the total storage cost. Regarding the new representa- 
tion proposed in this article, we indicate the number of fanVertex and the total 
storage cost. It can be observed that the number of fan vertices stored in the 
new representation is higher than that of the faces in the MOM representation. 
However, given that the cost to store a fan vertex is less than for a face, the 
storage cost of the new list provides a memory savings of about 20% over the 
face list. The repercussion of this is that the total storage cost is reduced by 
an important amount, approximately 15%, due to the fact that the list which is 
reduced is the ’’heaviest” list in the model (compare the number of faces with 
the number of vertices and LODs). 
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Table 1. Characteristics and storage costs 







Original 






MOM 




MOM- Fan 




Vertices 


Faces 


MB 


Lods 


Faces 


MB 


Fan V. 


MB 


Cow 


2,905 


5,804 


0.100 


2,803 


14,982 


0.237 


17,852 


0.202 


Sphere 


15,315 


30,624 


0.526 


15,264 


83,486 


1.306 


98,793 


1.104 


Bunny 


34,835 


69,451 


1.193 


33,990 


182,192 


2.876 


216,759 


2.445 


Phone 


83,045 


165,963 


2.850 


81,668 


441,181 


6.940 


523,844 


5.887 


Isis 


187,871 


375,736 


6.450 


187,370 


993,559 15.667 


1,181,373 13.309 


Buda 


543,653 


1,085,636 18.646 


543,106 


2,754,083 43.957 


3,297,593 37.598 



In figures 8(a) and 8(d) the behavior of the new representation is shown, 
using the Bunny and Buda models (some views of them are shown in figures 10 
and 11). On the X-axis the level of detail is represented, where 0 is the poorest 
and 1 the best. On the Y-axis we show the time spent (in seconds) for the 
model to recover the data pertaining to a given LOD and to visualise them. The 
behavior is similar to that obtained with the earlier MOM scheme and one can 
observe the linear response with respect to the number of triangles of the LOD 
visualised. The improvement gained by the use of triangle fans is diminished 
somewhat by the slight increase in data recovery time due to a more complex 
algorithm, the short length of fans (an average of 3.3 triangles per fan), the 
degenerate triangles, and the overhead caused by executing, for each fan, the 
instructions glBegin and glEnd in the OpenGL implementation. In figures 8(b) 
and 8(e) we show the number of non-degenerate triangles and the number of 
degenerates (around 23% of the total triangles) sent to the graphic subsystem 
per LOD. In figure 8(c) and 8(f) we show the number of vertices sent per LOD 
in the MOM representation, and those sent in the new representation (about 
40% fewer). 



6 Conclusions and Future Work 

In this article we present a new multiresolution scheme which permits the storage 
and visualisation of the distinct levels of detail as triangle fans. The objective is 
double: to reduce the visualisation time and also the space (storage) cost. 

The experimental results show a reduction of about 15% in storage cost with 
respect to the previous MOM representation, upon which the new approximation 
is based. However, the behavior of the new model regarding its visualisation 
time, is similar to its ancestor. A short average fan length, the high percentage 
of degenerate triangles, and the necessity to adjust the fans to the required LOD 
in real-time contribute to produce overall results which do not suppose a global 
improvement in visualisation time. 

This work will proceed from this moment on, toward the utilisation of triangle 
strips which, in principle, will permit a higher average number of triangles per 
strip than has been obtained using fans. In this manner we expect the storage 
cost to be further improved, as well as visualisation times. 
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(a) Visualisation times 



(b) Degenerate and 
non-degenerate faces 



(c) Vertices sent in 
MOM and MOM-Fan 






(d) Visualisation times 



(e) Degenerate and 
non- degenerate faces 



(f) Vertices sent in 
MOM and MOM-Fan 



Fig. 8. Results: a), b) and c) Bunny model; d), e) and f) Buda model 
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Fig. 9. Three levels of detail of the Sphere model visualised using fans 
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Fig. 10. Four levels of detail of the Bunny model visualised using fans 
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Fig. 11. Two levels of detail of the Buda model visualised using fans 
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Abstract. We introduce the distance transform for surfaces in 3D im- 
ages, i.e., the distance transform where every voxel in the surface is 
labelled with its geodesic distance to the closest voxel on the border of 
the surface. Then, the distance transform is used to identify the set of 
centres of maximal geodesic discs in the surface. The centres of maximal 
geodesic discs can be used to give a compact representation of any sur- 
face. In particular, they can provide a useful representation of the surface 
skeleton of solid volume objects. 



1 Introduction 

In [6] , the constrained distance transform was introduced for computing the mini- 
mal path from any point in the image to a goal point, avoiding obstacles. This can 
be used to find optimal paths between obstacles (e.g., for robot navigation), mea- 
sure perimeters, and determine convex hulls. A special case of the constrained 
distance transform is the distance transform of line patterns (DTLP), introduced 
in [12] and further investigated in [9]. In DTLP, distance information is prop- 
agated along the line pattern, starting from the end points of the line pattern 
itself (the background is used as an obstacle to the propagation). In presence 
of meeting lines that convey different distance information into their common 
meeting point, the minimum distance or the maximum distance can be used 
to continue distance propagation on the outgoing line. By selecting the max- 
imum distance, the DTLP can be used, for instance, to find the longest path 
through a line pattern. The minimum distance can be used to guide hierarchical 
decomposition. 

Here we extend the DTLP to surfaces in 3D images. In case of concurrent 
distinct distance values, we use the minimum distance to continue propagation. 
Every voxel in the surface is labelled with the distance to the closest voxel on 
the border of the surface. This implies that only open surfaces, i.e., surfaces 
that do not enclose any background component, can be considered. The first 
step in the algorithm is to detect the border of the surface. Then, once the 
distance transform of the surface is available, we identify therein the centres 
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of the maximal geodesic discs (CMGDs). These, similarly to the centres of the 
maximal discs in the distance transform of a 3D object, can be used to provide a 
new representation of the surface. In fact, the CMGDs constitute a subset of the 
surface which, in turn, can be seen as the union of the maximal geodesic discs. 
The CMGDs can be suitably linked to each other to obtain a curve representation 
with as many components as there are components in the surface. If this is done, 
one can easily treat images containing more than one object component, without 
need of resorting to a preliminary connected component labelling. Moreover, 
connected curves can be traced, which is not possible for sparse voxels, and 
can be vectorized so saving memory occupation. Thus, the representation of 
the surface in terms of connected sets including the CMGDs is convenient. We 
remark that the new representation does not coincide with the skeleton of the 
surface. In fact, loops can be created which do not correspond to tunnels of the 
surface, so that the topology is not preserved. Nevertheless, it can be seen as a 
new linear shape descriptor of surfaces in 3D images, where the curves can be 
interpreted as the main symmetry axes of the surfaces. 

To extract the new representation, it is convenient that the border of the 
surface is defined analogously to the contour of a 2D plane pattern. The approach 
to extract the new representation can then be seen as analogous to the one to 
extract the skeleton of the pattern from its distance transform. The contour of a 
pattern is the set of pattern pixels having a neighbour in the background. These 
pixels belong to protruding contour arcs, i.e., have no neighbours internal in 
the pattern, or belong to arcs delimiting the pattern itself, i.e., have neighbours 
internal in the pattern. To have a similar definition of border in case of a surface, 
we should identify there curves and edges, respectively corresponding to the 
protruding contour arcs and the arcs delimiting the pattern. Curves should be 
prevented from propagating distance information, as their 2D counterpart, the 
protruding arcs, consist exclusively of centres of maximal discs through which no 
distance information can be further more propagated. To this purpose, all curves 
have to be preliminarily identified and temporarily removed from the surface. It 
is then possible to identify only the border voxels (edges) from which distance 
information can be propagated to the surface. The curves are added again after 
the distance transform is computed. The distance label of their voxels is set to 
the value of the voxels in the edges of the surface, as also curves belong to the 
border. 

The new representation can be applied to any open surface. In particular, it 
can be applied to the surface skeleton of a 3D solid object. This latter case is 
treated in this paper. 

2 Definitions and Notions 

In this paper, 3D bilevel images, i.e., volume images consisting of object and 
background, will be considered. Each voxel has three types of neighbours in 
its 3 X 3 X 3 neighbourhood: six voxels sharing a face, twelve voxels sharing 
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an edge, and eight voxels sharing a point with the central voxel. We will use 
26-connectedness for the object and 6-connectedness for the background. 

Each voxel in the object can be labelled with the distance to its closest 
background voxel. This labelling can be performed by computing the distance 
transform (DT) of the object, [2]. The distance depends on the chosen metric. 
Two simple metrics, commonly used, are the and the metrics, where the 
distance between two voxels, v and re, is equal to the number of steps in a minimal 
6- and 26-connected path, respectively, between v and w. The corresponding 
DTs, here called DT^ and ^ can be computed by propagating distance 

information from the background in one forward and one backward scan over 
the image. Each voxel is assigned the minimum of the label of the voxel itself 
and the labels of its already visited neighbours increased by one. Eor DT^ only 
face neighbours are considered. 

DT^ and DT‘^^ give rough approximations to the Euclidean distance. To 
have a better approximation a weighted distanee transform can be used. Eor the 
weighted DT, we use different weights, W 2 , and res, for the distance to a 
face, an edge, and a point neighbour, respectively. Computation of the DT using 
weights rei = 3, rc 2 = 4, and res = 5, has been shown to give a good approxima- 
tion to the Euclidean distance, [2]. The DT using rei = 3, re 2 = 4, and res = 5 
will here be denoted 3 — 4 — 5 DT. When computing the DT, the contribution 
given by each already visited neighbour of a voxel is its corresponding label in- 
creased by the relative weight. To be consistent, we use weights also for DT^ and 
. Eor w\ = 1, rc 2 = 2, and res = 3, while for ^ w\ = 1, rc 2 = 1, 
and res = 1- 

The distance label of a voxel v in the DT can be interpreted as the length 
of the radius of a ball, centred on the voxel and fully enclosed in the object. 
In the following, we will denote by v both the voxel and its associated distance 
label. A voxel is eentre of a maximal ball (CMB) if the corresponding ball is not 
completely covered by any other single ball in the object, [1]. This can be checked 
by a suitable label comparison. A voxel 'c is a CMB if all its neighbours n^, 
i = 1, . . . , 26, with proper weights Wj^ j = 1, 2, 3, satisfy 

Hi < V Wj. 

Eor any voxel r’, is the number of 26-connected object components in 



the 3 X 3 X 3- neighbour hood of v. N is the number of 6-connected background 
components, having r’ as a face- neighbour, in the 3 x 3 x 3- neighbour hood of v 
where the point neighbours are disregarded (in other words only 18 neighbours 
of V are taken into account). Eor a deeper discussion, see [7,10,4]. Here we use 
the algorithm introduced in [3] for computing N ^ and N‘^^. 

3 Computing the Distance Transform of Surfaces 

Before computing the DT of a surface, we should detect its border. In fact, the 
minimum distance will be propagated starting from it. The labels in the DT 
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will indicate the distances to the background along paths in the surface, passing 
through the border. Therefore, we set the border voxels to wi instead of 0. The 
same convention is used for the end points when computing the DTLP, [12,9]. 

Surface skeletons can consist of both surfaces and curves, depending on the 
shape of the original object and the used metric. When computing the DT of 
the surface skeleton, only the voxels in the edges should be allowed to propagate 
distance information to the surface. Curves do not border any surface portion 
and we should prevent that any propagation occurs from them. Thus, we detect 
and temporarily remove the curves, before detecting the edge voxels from which 
propagation can be done. After the DT has been computed, the curves are added 
and the distance label of their voxels is set to wi . 

Our method can be applied to any surface skeleton of a solid 3D object. We 
prefer a skeleton that has a number of desirable properties, as it is the case for 
the surface skeleton introduced in [ 11 ], which will be used through the rest of the 
paper. The surface skeleton is centred within the original object with respect to 
the metric, topologically equivalent to the original object, almost symmetric, 
and allows complete object recovery. This last property, unfortunately, implies 
that the skeleton can be two- voxel thick. Thus, before detecting the border of 
the surface skeleton, this has to be reduced to a one-voxel thick surface. To 
this purpose, we use the local operations introduced in [4] to obtain a one- 
voxel thick surface skeleton starting from the two- voxel thick set. Thinning is 
achieved in six scans of the image, one for every face direction. During each scan, 
only configurations that are two- voxel thick in the scanning direction are taken 
into account, see Fig. 1 . Voxels in these configurations are sequentially removed 
provided that their removal does not change the topology. 

The above thinning is, however, not enough to obtain a one- voxel thick sur- 
face in presence of L-shaped regions, see Fig. 2. To remove voxels in such regions 

another step is necessary, based on the use of an L-shaped configuration detector. 



During this step, only voxels having N ^ =1 (grey voxels in Fig. 2) and at least 
three face neighbours in the skeleton are considered. Among them, the voxels 

with two face neighbours in the skeleton, that are edge neighbours of each other 

28 

and have both N ^ = 1 (•) or have both N ^ 7 ^ 1 (o), are marked as candidate 
to removal. Removal of marked voxels is then done sequentially, provided that 
topology is not altered. The resulting set is shown to the right in Fig. 2. 

Once the one-voxel thick surface is obtained, the border is detected in two 

^g 

steps. In the first step, curve voxels are defined as voxels having N ^ = 1 
and > 2 (the case > 2 occurs at curve junctions), [7,10]. Since prop- 
agation of distance information should not occur from curves, all voxels with 
N f = 1 and > 2 are temporarily removed. Note that the tips of the curves, 
end points, are not detected by the previous criteria and hence remain in the 

surface skeleton. This does not create any problem, since they are voxels iso- 



lated from the rest of the surface. In the second step, voxels having Nf = 1 
and iV26 = 1 are detected as edge voxel. With reference to Fig. 2, grey voxels 

^g 

are voxels having N ^ =1 and = 1 , hence they would be identified as edge 
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i T ^ ^ / / 

top-down down-top right-left left-right back-front front-back 



Fig. 1. Two- voxel thick configurations in the six scanning directions. The current 
voxel of the surface skeleton is shown in grey, the other voxel in the skeleton in 
black and background voxels in white 




Fig. 2. Surface skeleton before, left, and after, right, removal of voxels in L- 



shaped configurations. Grey voxels satisfy N f =1. See text 



voxels. Removal of voxels in L-shaped configurations is indispensable to avoid 
that these voxels are wrongly classified. 

To compute the DT of a surface, we use, analogously to the “regular” DT 
case, the weights wi for the distance to a face neighbour, W 2 for the distance to 
an edge neighbour, and ics for the distance to a point neighbour. Edge voxels are 
labelled w\. Distance information is propagated from the edge over the surface. 
Each voxel in the surface is assigned the minimum of the label of the voxel itself 
and the labels of its already visited neighbours, also belonging to the surface, 
increased by the corresponding weight. The propagation is repeated, alternately 
in forward and backward fashion, until no more changes occur. The number of 
scans needed depends on the complexity of the surface. Eor discussion about the 
2D case, [8]. The number of iterations of forward and backward scans needed for 
the examples shown in Eigs. 5 and 6 are 6 and 2, respectively. 

4 Representing the Surface 

Any voxel in the DT of a surface can be seen as the centre of a geodesic disc 
with radius equal to the distance label of the voxel, and fully contained in the 
surface. A geodesic disc centred on a voxel v can be computed by applying the 
reverse distance transformation, [4] , using 'c as a seed and the background of the 
surface as an obstacle. In Eig. 3, top, a surface is shown that consists of planes 
that meet each other along segments in face, edge, and point directions. The 
geodesic discs shown in Eig. 3, bottom, are centred at the intersection of the 
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above segments and correspond to different DTs, DT^ ^ ^ and 3 — 4 — 5 DT. 

The radius of the disc is the same for all DTs. 





Fig. 3. Above, a surface consisting of four planes meeting each other along seg- 
ments in face, edge, and point directions. Below (from left to right), geodesic 
discs centred at the intersection of the above segments and with radius 25, using 
and 3 — 4 — 5 DT, respectively 



4.1 Centres of Maximal Geodesic Discs 

A centre of a maximal geodesic disc can be defined in the same way as the 
centre of a maximal disc for 2D objects and the centre of a maximal ball for 
3D objects, [1]. Thus, a centre of a maximal geodesic disc (CMGD) is a voxel 
whose corresponding geodesic disc is not completely covered by any other single 
geodesic disc. A voxel v in the DT is a CMGD if all its neighbouring voxels n^, 
z = 1, . . . , 26, in the surface, with the proper weights iCj, j = 1, 2, 3, satisfy 

Ui < V Wj. 

In Fig. 4, the sets of CMGDs for the surface shown in Fig. 3, top, are shown for 
DT^, and 3 — 4 — 5 DT, respectively. 

The union of the geodesic discs corresponding to the set of CMGDs is equiv- 
alent to the original surface. 



4.2 Connecting the Centres of Maximal Geodesic Discs 

To have a representation of the surface with the same number of components as 
those of the surface, the CMGDs should be linked to each other. This could be 
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Fig. 4. From left to right, the set of CMGDs for the DT of the surface shown in 
Fig. 3, top, using DT^ , and 3 — 4 — 5 DT, respectively 



done, e.g., by growing paths in the direction of the steepest gradient in the DT 
starting from each CMGD. 

Indeed, besides the CMGDs, also other voxels, here called saddle voxels and 
braneh point voxels^ are important to gain the connectedness of the final rep- 
resentation. Saddle voxels are defined as voxels having in the surface skeleton 
more than one component of neighbours with higher label, or more than one 
component of neighbours smaller label. A branch point voxel is defined as a 
voxel having more than two components of neighbours with the same label as 
the voxel itself. In all cases, we use 26-connectedness. 

The CMGDs, saddle voxels, and branch point voxels are called intrinsie. 
They are detected and marked in one scan of the DT, before starting path 
growing. During path growing, suitable voxels, here called indueed are detected 
and marked on the DT. The DT is repeatedly inspected until no more voxels 
are marked. For each already marked voxel, n, either intrinsic or induced, its 
neighbourhood is inspected. The voxel(s) n^, z = 1, . . . , 26, on the DT, that by 
using the proper weights iCj, j = 1, 2, 3, maximise the gradient 

grad(ni) = 

Wj 

are marked as belonging to a growing path. We denote the highest gradient 

gvadjYiQj^ . 

Due to the fact that we are growing paths on surfaces, where 18-connected- 
ness is possible, we can have paths in certain directions that alternately include 
one single voxel and a pair of edge connected voxels. This might create diverg- 
ing paths, as the steepest gradient could occur along non-connected, diverging 
directions from of the two voxels. We force, if possible, path growing to continue 
in only one direction, whenever two or more connected neighbours equally max- 
imise the gradient. Thus, if a voxel can propagate on a number of neighbours 
all with grad{ui) = gradmax^ and constituting a unique 26-connected compo- 
nent, we originate a single path by choosing, as the next voxel in the path, only 
one neighbour of the component of n^’s. 

In Figs. 5 and 6, two examples of the set of CMGDs and the corresponding 
connected curve representations are shown. We started from 3D solid objects. 
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a cylinder and a digital Euclidean ball, respectively, and computed the surface 
skeletons by using the algorithm introduced in [11]. These were reduced to one- 
voxel thick surfaces, as described in Section 3. The surface skeletons were then 
also simplified by pruning some non-significant peripheral curves by means of 
the algorithm introduced in [5]. The metric used is . 




Fig. 5. From left to right: A cylinder with radius 25, and height 50, the simplified 
surface skeleton, the set of CMGDs, and the connected curve representation 
obtained after path growing 




Fig. 6. From left to right: A digital Euclidean ball with radius 28, the simplified 
surface skeleton, the set of CMGDs, and the connected curve representation 
obtained after path growing 



In Fig. 5, right, it is possible to see paths alternately including one voxel and 
a pair of edge connected voxels. We also note that the curve representation is not 
topologically equivalent to the surface skeleton and, hence, to the object. In fact, 
a number of loops characterise the curve representation, that do not correspond 
to tunnels in the surface skeleton and the object. The curve representation is 
not the curve skeleton of the object. 

Note that it is generally preferable to remove the markers from the CMGDs 
with label wi (except for the case in which they belong to curves) before perform- 
ing path growing. In fact, these CMGDs might cause the growth of a number of 
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noisy paths. The marker has been removed from CMGDs with label wi in the 
examples shown in this paper. 

5 Conclusion 

We have introduced the distance transform of surfaces in 3D images and have 
used it to extract the centres of maximal geodesic discs. The centres of maxi- 
mal geodesic discs provide a new representation of a surface in general, and of 
the surface skeleton of 3D solid objects in particular. To compute the distance 
transform, we detect the border. This is done in two steps. First the curves are 
identified and temporarily removed, then the edges are identified and used for 
propagating distance information. 

The set of centres of maximal geodesic discs is connected by a path growing 
process, to obtain a richer representation of the surface consisting of a set of 
connected curves. The representation has the same number of object components 
as the surface, but might have loops. Hence the resulting representation is not 
topologically equivalent to the surface, (which is topologically equivalent to the 
original object when the surface is indeed the surface skeleton of the object). 

Future work will be to further investigate the surface representation here 
introduced and, in particular, to identify the border starting directly from two- 
voxel thick surfaces. This is important to make our algorithm more general. 
We will also investigate the distance transform for surfaces where we use the 
maximum distance in case of concurrent distinct distance values. This will need 
a more complicated algorithm but could give interesting results. 
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Abstract. In this work, we present an optimal solution to the fol- 
lowing problem: given a Freeman chain-code curve with n elements, 
and m points of it, find the minimum envelope of the curve by a set 
of line segments. This segments are obtained modifying the coordinates 
of these m points up to a distance h. The complexity of this algorithm is 
0(nh-\-mh^), and it needs a storage of Oimh) data. In addition, we pro- 
pose a greedy approximation algorithm that provides good results with 
lower complexity 0{nh) in the worst case, and memory requirements 
0(h). A pre-processing with 0(rnn) is also needed for both algorithms. 
Some experimental results are shown. 



1 Introduction 

Computing minimal enclosures for geometric curves is a fundamental geometric 
optimization problem. This problem arises in applications in which one wishes to 
find an approximation to a curve that is simpler in the sense of having a smaller 
number of elements. The main difference respect of the polygonization of digital 
curves is that we impose the enclosure restriction. The problem of computing 
minimum convex polygonal enclosures of a given number of sides for a convex 
polygon was efficiently solved by Aggarwal and Park [1]. They studied monotone 
matrix-searching techniques for this optimization problem. 

Related work has been done on finding a minimal enclosing triangle of a con- 
vex polygon [2] , a minimal enclosing k-gon [3] , a minimal enclosing equiangular 
k-gon [4] and minimum polygon enclosures with specified angles [5]. 

Several heuristics are available for “approximating” a contour by a set of 
line segments [6], [7] according to various criteria. All these methods are com- 
putationally expensive (usually between O(n^) and 0(n^)). Although most of 
these works try to produce good results, few attempts have been made so far to 
actually yield strictly optimal solutions under well-defined optimization criteria. 
In this work we present an optimal solution to the following problem: 

Given a sequence of n integer-coordinate points and assuming that the curve 
is represented using the Freeman chain-code, we describe a digital curve C as: 

* This work was supported by the Xunta de Galicia grant PGIDT99PXI20602B 
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C = {Pj = 

Where Pj-\-i is a neighbour of pj. The purpose of this work is finding another 
sequence of m points that we will call E = {dj^j = 1, ...m} where m < n. The 
digital curve related to E will be the one conformed by joining these points by 
means of segments. This curve is defined as a set of linear functions: E = {fk,k = 
1, ...m — 1} so that fk is the segment that joins dk and dk-\-i. The sequence of 
points E has to fulfill the following two conditions: 

1. Its associated closed digital curve E is envelope of C. 

2. Defining S{C) and S{E) as the number of inner points to C and to E re- 
spectively. The goal is to get is the minimum value for | S{C) — S{E) \ that 
is referred to as error area. 

As a starting point, we define as a subset of m representative points of C. 
This subset should offer the maximum possible information on the shape of C, 
so that its digital associated curve be as similar as possible to C. It can be 
proved [8] that the most characteristic points in C will be those that show the 
biggest curvature, which are called dominant points. However, the digital curve 
associated to E may intersect C, not fulfilling the first demanded condition 
to be an envelope. Therefore, we propose an algorithm that carries out slight 
modifications on the position of the points in E so that their associate digital 
curve can verify both conditions. 

In this work two different techniques are suggested to obtain the envelope. 
One of them estimates the best solution in terms of minimizing the error area; 
since that starting from a initial E set, the coordinates of their elements are 
modified until an associated curve is obtained with the smallest error area. The 
second is a greedy heuristic solution that although doesn’t secure the best result, 
finds a close estimate with less computational and memory requirements. 

The digital curves that we have used show the following restriction: the values 
of Xj are monotonous regarding j, that is to say, 0 < Xj+i — Xj < 1. In case that 
the digital curve doesn’t fulfill this restriction, it can be broken down in tracts 
that do fulfill it and each can be analyzed independently. In order to obtain the 
envelope, and to maintain this property of the digital curves, we have imposed 
a limitation, that is the only coordinate that could be modified in the position 
of the dominant points is its y coordinate. 

For the calculation of the dominant points of C we have used the Kankanhalli 
algorithm [9]. This means an improvement on the Teh-Chin algorithm [10] in 
which computational work is strongly reduced. Also, it maintains two important 
characteristics of the original algorithm: the fact that the entry parameters are 
not required, and the determination of the dominant points based on the local 
curvature. We have modified the algorithm allowing the selection of the dominant 
points based on their importance. This will allow us to select a smaller number of 
points knowing that these they will be the most significant in the curve. Anyway, 
the algorithms could be applied to another set of points. An alternative proposed 
by [11] use naive lines to obtain this points. 

Once obtained the first initial set E with an arbitrary number of dominant 
points of the curve, the following step, common to both proposals, is to obtain 
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information on the characteristics of the C curve among serial elements of E. 
The following section describes how to obtain that information. 

2 Obtaining Information of the Curve 

This preprocessing operates on the tract of C between two consecutive dominant 
points {di^ dij^i}cE that we will call di and d^ respectively. The analysis of each 
tract is independent of the rest of them. If we define as df the point {xi, yi k)^ 
the purpose of this step is to obtain, for each df ^ the points of tangency t^eC. 
These are defined as the points belonging to the intersection of C with the 
straight line that goes through element and that is tangent to C in the 
segment {di,dr}. A representation of these points can be seen in Figure 1. For a 
given and its corresponding tf^ by means of an extrapolation of the tangent 
straight line, the position of the associated element d^ can be obtained. Notice 
that the minimum value of k is zero, and the maximum is the one in which the 
corresponding point d^ belongs to C, or equivalently k' = 0. 

It is useful some structure like the one shown on Table 1 to store all the 
relevant information on the process. The values that are shown are those cor- 
responding to the situation given in Figure 1. This structure has a column for 
each possible value of k. 

In order to obtain this table, the entry data are the digital curve C evaluated 
in the segment {di^ dr} and, inside that interval, the element with the maximum 
value of y coordinate: Pmax = (xmaxi l/max)- This point can be obtained without 
additional complexity during the calculation of the dominant points. 



Table 1. Data structure associated to a tract of C 



LevelO Level 1 Level2 Levels Level4 


d? 4 


4 


df df 


t°l tj 


4 


tf tf 


dt dl 


dl 


dl dl 




Fig. 1. Representative points of a tract of C 
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The process is divided in two equivalent problems: obtain the tf in the interval 
[xi^Xmax] for different df values, and in the interval [xmaxjXr] for different 
values. The values of k and k' allowed will be those that verify respectively, the 
following conditions: ke[yi^ymax] and k' e[yr^ymax\- We should point out that, 
although the problem has been broken into two halves, it is only necessary one 
access to the points of C, since these two halves are disjointed. If the curve has 
more than a maximum value, then the first and the last of them have to be 
considered, being irrelevant in the calculations the intermediate points of C. 

An outline of the used algorithm is shown below. Since the resolution of both 
problems is equivalent, the shown algorithm corresponds to the resolution of the 
tract of C among di and Pmax that we call C' . The other part of the tract is 
solved in the same way going over curve C from dr to Pmax- 

Algorithm TANGENCY_POINTS 
input 

C': Interval of C between di and pmax 

output 

List of entries {df, tf} 

begin 

for each pjeC' 

aj slope of the straight line that links di with pj 

if aj is maximal 

else 

h evaluate -increment{dL^Pj) 

tf ^ pjMk > h 

end if 

end for 
end algorithm 

This algorithm is based on two properties [12]: 

1. Given a point pjeC and the slope aj of the straight line that joins it with di. 
If the value of this slope is higher than the one obtained for the previous 
elements with a lower abscissa, then none of these previous elements is tan- 
gential. Therefore, the considered point is a tangential point for all df . 

2. Given a point pjcC that doesn’t verify the previous condition, but with a y 
coordinate higher than the y coordinate of the element pj' that has given 
the maximum slope. Then, this point is tangential to all the df that have a 
value for k equal or higher than the one given for the Expression 1. 

h = — —{xj> - xi){xj - xi) (1) 

Through the outermost loop in the algorithm, the points of the digital curve 
inside the considered interval are considered, and we check whether they verify 
property 1. To carry out this, it is only necessary to store the maximum value 
of the slope of the processed points. Among all the points that do not verify it, 
it is only necessary to consider those with a value in their coordinate y higher 
than the one of the point which has given the maximum slope. For these points, 
equation 1 is used to obtain the value for k for which they become tangential. 
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There is a question that is not mentioned in the algorithm due to lack of 
space, that is checking that the rest of the entries tf with k > h are not associated 
to another tangential point. If it is noticed that any tf is assigned to another 
element pj> different from the one of the maximum slope (another tangential 
point exists) and it is necessary to evaluate again expression 1 between pj and 
the new element. 

Carrying out a similar process on the interval [xmax^Xr] one can obtain the 
position of the tangency points for the different df. Combining both results and 
extrapolating the values, all the df and d^ positions higher than Pmax can be 
obtained as shown in Figure 1. In this figure, the dashed lines link the points 
that are extrapolated. 

3 Obtaining the Optimal Envelope 

The problem lies in figuring out the sequence of d^ points whose associate digital 
curve F is a minimum envelope. This method makes use of an exhaustive search 
among all the possible envelopes to the curve. For each d^ element belonging to 
the tract, the neighbour of the previous tract is searched so that when joined, the 
associate curve has a minimum error area. An example can be seen in Figure 2, in 
which two serial tracts of C are represented that span between points {<ii, 0 ^ 2 , 
Point d^ can be joined with d^^ so that the segment is tangent to C. However, it 
can also be joined to d^ or the d^ satisfying the condition of being a envelope, 
and although locally it is not the optimum, it can be part of the global optimum. 

The number of combinations for joining serial dominant points would grow 
exponentially with the number of tracts unless the condition of minimizing the 
error area was demanded. Due to this, the search space in our algorithm decreases 
considerably and it is possible to enunciate the following property: for a certain 
dominant point d^ there is at least one neighbour d^_^ with which it can form a 
minimum envelope [12]. 




Fig. 2. Possible links between d^ and d 2 
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Each tract is limited by two elements {di,dij^i}eE and has associated a 
table with all the possible tangency points. This table is obtained with the 
method described in the previous section. Each level contains the following fields: 
area, link}. The field area stores the error area accumulated in that 
tract, and link indicates the element of the structure immediately previous to 
the one the current level is connected to. 

Algorithm OPTIMAL.ENVELOPE 

input 

{C ,E}: Digital curve and dominant point set 

output 

E: Enclosed polygon points 

begin 

for z=l,m-l 

tangency[i] tangency -point s{di^diJ^\^pmax^C) 

for each level e tangency[i\ 

local -area <= evaluateJocal-area{pi ^p^ ) 

S add -Candidates {tang ency[i — 1]) 

if ^ 0 

level.link select -minimun jar ea{S) 

else 

level.link add jnew -entry {tang ency[i — 1]) 

end if 

level. area local jar ea-\- add jaccum jar ea{lev el. link) 

end for 

end for 

take-minimun-envelope{tangency[i]) m — 1 > i > 0 

end algorithm 

The OE algorithm is shown above. It successively processes the different 

tracts of C. For each one, the algorithm obtains the table with the possible 

tangency points, storing them in the structure called tangency. Next, it goes 
over all the levels of the generated table. For each level of the table the function 
evaluate -local jar ea{) is executed. This function obtains the area of the surface 
embraced by the tangent segment to C regarding a coordinate of reference yref- 
The value of this area is proportional to the error area, so that the entry that 
obtains the lowest value will represent the tangential tract that minimizes the 
area of error of the interval. In Figure 2, the tangent segment to C associated 
to point d^ is the one that joins it to d\. The local area appears shadowed in 
the figure for that situation. Notice that this calculation is independent of the 
element to which is linked. 

For each d^ it is necessary to calculate the set S of levels of the previous table 
with which it can be joined. This is carried out by function add-candidates{) . 
For example, in Figure 2, the group associated to d^ would he S = {d^^d^^d^}. 
For each one of the possible candidates the accumulated area is evaluated. For 
the union d^ — d\ the area will be the one that this last point has accumulated 
plus the local one to d^. However, for the union d^ — d^ the area of the triangle 
formed by points d^ — d\ — d 2 has to be added to the above-mentioned one. 
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Fig. 3. New level in d 2 due to 

Of all the possibilities of union, the one that obtains the smallest accumulated 
area is selected since it minimizes the area of global error of that level. Applying 
this procedure iteratively, it is sure that, when the possible values for d^ are 
analyzed, the accumulated areas of all the dj with j < i will be the minimum. 

A particular situation that is necessary to consider is when there is no element 
of the previous table that can be joined to the entry under consideration. This 
is the case, in Figure 3, of point d^. In this situation, and in order to maintain 
the coherence of the information, by means of add jnew .entry () a new entry is 
created in the previous table that contains the new value of d^_-^ with which 
it can be joined. The rest of the fields of this new entry are the same that of 
the entry d^zl immediately lower. In the figure, the entry associated to 

the point d^ would be created. The only exception is the accumulated error 
area, since it is necessary to bear in mind the increment in area in a similar 
way to how we did in the previous section. Again, in the figure it would be 
necessary to add to the accumulated area associated to point the area of 
triangle d\ — d^ — . An important property is that the link of this new entry 

is the same that the one of the immediately lower entry. This is because the area 
difference accumulated among both entries (the triangle) is independent of the 
element to which is linked. As the union of the previous entry (d^ in the figure) 
is the one with the smallest accumulated area of all the possibilities, that will 
also be the case for the new entry. 

When all the intervals have been processed, only the entry of the last tract 
with the smallest accumulated area is taken. Next, we go back over the previous 
tracts and by means of the field /m/c, we obtain the sequence of points E whose 
associated digital curve has the smallest error area. 
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4 An Heuristic Method of Obtaining the Envelope 

The method described in the previous section makes it certain obtaining a min- 
imum error area. However it has several inconveniences: the structures called 
tangency of each tract must be stored, all the entries of each one of them should 
be analyzed and it is necessary to go over them twice, one to obtain all the pos- 
sible solutions and the other one, in the opposite direction, to extract the best 
one. In this section we show a greedy heuristic method with much less compu- 
tational work, less demanding in terms of memory and that provides a solution 
close to the best one. 

The HE algorithm is shown below. The different intervals of C are processed 
consecutively calculating for each of them the tangency points. The difference 
regarding the previous method is that now a local reduction is intended in the 
error area. For each interval, only the central dominant point di, and the two 
immediate neighbours di-i y are considered. Next, an increase in the co- 
ordinate yi of di is conducted, this might imply a decrease of the y coordinate 
of their two neighbours as it is shown in Figure 4. 

Algorithm HEURISTIC_ENVELOPE 
input 

{C ,E}: Digital curve and dominant point set 

output 

E: Enclosed polygon points 

begin 

for 

tangency[i] tangencyjpoints{di,di-^i,pmax,C) 

Aerrorjarea evaluatejerrorjarea{yi -hi) 
while [Aerror -area < 0) 

yi^yi + 1 

Aerrorjarea evaluate-errorjarea{yi -h 1) 

end while 

end for 
end algorithm 



The increment experienced in the error area derives from the expression 
Aerror-area = (42 -h 43) — (40 + 41 -h 44), where the first term takes into 
account the gained error area and the second the lost one. For each modification 
in di starting from the tables tangency[i\ and tangency [i — 1] the value of the 
new positions of di-i and di-^i are obtained. Once known, we check whether a 
net reduction in the error area has taken place and we move on to evaluate the 
following increase in yi. It can be easily shown that the previous relationship 
can be rendered in a function that is dependent only on the coordinates x of the 
points [12]. Expression (2) is verified if only if Aerror-area < 0. 

{X0-Xa){x2-Xa){x0-X2) < {Xa-Xi)(Xa-Xo)(x0-X2)-{x3-X0)‘^{x2-Xa) ( 2 ) 

Where and xg correspond to the abscissas of the point of intersection of 
the segment with the initial position and the one that has the final position. 
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This expression allows us to determine, in a quick way, if there is a reduction 
in the error area or not. This process verifies the following property: when for 
some value yi the previous inequality can no longer be verified, for the remaining 
higher than values it will not be verified either [12]. 

Out of this property we can conclude that starting from the last di that 
verifies the inequality there will not be any other one that produces a smaller 
error area. This way the algorithm of local minimizing of the area should only 
evaluate the inequality for point yi~\-l. If this is verified, then y^ is incremented. 
This process will be repeated until the inequality no longer is satisfied. 

5 Results 

The computational complexity of the OE algorithm is 0{nh^mh‘^). The number 
of entries that are needed to store the information according to Table 1 is 0{mh). 
By contrast, in the case of the HE algorithm, the complexity is 0(n/i), and the 
amount of memory needed to store the information is 0{h). Several rugged curves 
were used as benchmark to prove and compare both algorithms. We use a curve 
of 1666 points to illustrate the results, and similar behaviours were obtained 
on other curves. A section of this curve with different number of points and 
its envelope is shown in Figure 5, notice the irregularity of its shape. A pre- 
processing with 0{mn) is also needed for both algorithms. 

Figure 6(a) shows the execution time consumed by both methods when they 
are run on a SUN Enterprise 250 computer. Notice that the reduction in runtime 
is important; in some cases is even less that half. Figure 6(b) shows the maximum 
amount of memory needed to store the tangency tables. The great improvement 
in this issue is clear. Finally, figures 6(c) and 6(d) show the comparison in terms 
of number of error pixels between both methods. The difference is important 
when m is low (40% with 3 points) but it fastly lows to 1% or less. 




Fig. 4. Effect of increasing yi 
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Fig. 5. Curve examples 
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Fig. 6. Efficiency of both approaches 
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Abstract. Simple cube-curves in a 3D orthogonal grid are polyhedrally 
bounded sets which model digitized curves or arcs in three-dimensional 
euclidean space. The length of such a simple digital curve is defined to 
be the length of the minimum-length polygonal curve fully contained 
and complete in the tube of this digital curve. A critical edge is a grid 
edge contained in three consecutive cubes of a simple cube-curve. This 
paper shows that critical edges are the only possible locations of vertices 
of the minimum-length polygonal curve fully contained and complete in 
the tube of this digital curve. 



1 Introduction 

The estimation of the length of a simple digital curve in three-dimensional eu- 
clidean space may be based on the calculation of the shortest polygonal curve in 
a polyhedrally bounded compact set [11,12]. This paper presents an analysis of 
possible locations of vertices of such a polygonal curve. This analysis has been 
used in [2] for the design of an iterative algorithm approximating such curves 
with measured time complexity in (9(n), where n denotes the number of grid 
cubes of the given digital curve. 



1.1 Digital Curves in 3D Space 

Any grid point (i, j, k) c 7^^, j, k integers, is assumed to be the center point of 
a grid cube with faces parallel to the coordinate planes, with edges of length 1, 
and vertices. Cells are either cubes, faces, edges or vertices. The intersection of 
two cells is either empty or a joint side of both cells. We consider a non-empty 
finite set K of cells such that for any cell in K it holds that any side of this cell 
is also in K. Such a set AT is a special finite euclidean complex [9]. Let dim(a) 
denote the dimension of a cell a, which is 0 for vertices, 1 for edges, 2 for faces 
and 3 for cubes. Then [AT, C, dim] is also a cell complex [5,7,9,13] with properties 
such as (i) C is transitive on AT, (ii) dim is monotone on K with respect to C, 
and (iii) for any pair of cells a^b ^ K with a (Z b and dim{a) + 1 < dim(b) there 
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exists a cell c G K with a G c G b. Cell b bounds cell a iff a C 6 , and 6 is a proper 
side of a in this case. Two cells a and b are ineident iff cell a bounds 6 , or cell b 
bounds a. 

We define digital eurves g in 3D space with respect to such a euclidean 
complex as special sequences {zq, zi, Zm) of cells where Zi is incident with Zi+i, 
and \dim{zi) — dim{zi-^i)\ = 1, for i 1 {mod m + 1). There are (at least) 
three different options which may depend upon an application context, or upon 
a preference of either a grid-point model or a cellular model which are dual 
approaches [4]. Let n > 1 . 

(i) An edge-eurve is a sequence g = (uq, cq, ui, ei, e^) of vertices Vi and 
edges e^, for 0 < i < n, such that vertices Vi and are sides of edge e^, for 
0 < z < n and = uq. It is simple iff each edge of g has exactly two bounding 
vertices in g. It follows that a vertex or edge is contained at most once in a 
simple edge curve. ^ 

(ii) A faee-eurve is a sequence g = (eg, /o, ei, /i, ..., e^, /n) of edges and 

faces /i, for 0 < i < n, such that edges and are sides of face for 

0 < z < n and e^+i = cq. It is simple iff n > 4, and for any two faces /i, fk in g 

with \i — k\ >2 {mod n + 1 ) it holds that if fiGfk 7 ^ 0 then \i — k\ = 2 {mod n + 1 ) 
and fi n //c is a vertex. 

(iii) A eube-eurve is a sequence g = (/o, cq, / i, ci, ..., /n, c^) of faces fi and 

cubes Q, for 0 < i < n, such that faces fi and /i+i are sides of cube q, for 

0 < i < n and /n+i = /o- It is simple iff n > 4, and for any two cubes q, Ck 
in g with \i — k\ > 2 {mod n + 1 ) it holds that if q fl c/c 7 ^ 0 then either 
\i — k\ =2 {mod n + 1) and q H Ck is an edge, or \i — k\ = 3 {mod n + 1) 
and Ci n c/c is a vertex. A tube g is the union of all cubes contained in a cube- 
curve ^f. It is a polyhedrally-bounded compact set in 7^^, and it is homeomorphic 
with a torus in case of a simple cube-curve.^ 

1.2 MLP in 3D Space 

This paper deals exclusively with simple cube-curves. The cube-curve on the left 
of Fig. 1 is simple, and the cube-curve on the right is not. The latter example 
shows that the polyhedrally-bounded compact set g of a cube-curve g is not 
necessarily homeomorphic with a torus if each cube of this cube-curve g has 
exactly two bounding faces in A (Jordan) curve is eomplete in g iff it has a 
non-empty intersection with any cube contained in g. 

Definition 1. A minimum- length polygon (MLP) of a simple eube-eurve g is 
a shortest polygonal simple eurve V whieh is eontained and eomplete in tube g. 

Following [11,12], the length of a simple cube-curve g is defined to be the length 
1{V) of an MLP of g. 

^ This definition is consistent with, e.g., the definition of a 4~curve in [10] (see propo- 
sition 2.3.3) for 2D grids where our edges are ‘hidden’ in a neighborhood definition, 
or of a closed simple path in [14] (see page 7) for undirected graphs. 

^ Closed simple one- dimensional grid continua [11,12] are defined such that each cube 
of g has exactly two bounding faces in g. 
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Fig. 1. Two cube-curves in 3D space 



A simple cube-curve g is flat iff the center points {i,j^k) of all cubes con- 
tained in g are in one plane parallel to one of the coordinate planes. A non- flat 
simple cube-curve in 71^ specifies exactly one minimum-length polygonal simple 
curve (MLP, minimum-length polygon) which is contained and complete in its 
tube [11]. The MLP is not uniquely specified in flat simple cube-curves. Flat 
simple cube-curves may be treated as square-curves in the plane, and square- 
curves in the plane are extensively studied, see, e.g. [6]. It seems there is no 
straightforward approach to extend known 2D algorithms to the 3D case. An 
important reason for that may be that 2D algorithms for (multigrid-convergent) 
perimeter estimation [6] may be such that all calculated vertices are grid points 
or vertices, but in the 3D case we are faced with a qualitatively new situation for 
the calculated vertices. The minimum- length polygon considered in this paper 
leads to vertices with real coordinates (not just multiples of integers as in the 2D 
case), i.e. the model of cell complexes is considered as being embedded into the 
euclidean space. However, independent upon the dimension global information 
has to be taken into account for length calculation of digital curves to ensure 
multigrid convergence. 

2 Simple Cube- Curves 

This section contains fundamental definitions and properties related to simple 
cube-curves. 



2.1 Non-contract ible Curves in g 

Let be a simple cube-curve, and V = (po,Pi,---, Pm) be a polygonal curve 
complete and contained in g, with po = Pm- 

Lemma 1. It holds m > 3, for any polygon V = (po,Pi, •••,Pm) complete and 
contained in a simple cube-curve. Two line segments alone eannot he eomplete 
in any simple eube-eurve. 
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Fig. 2. Curves complete and contained in a tube 



Proof. Cases m = 0 and m = 1 would be a point, and m = 2 would be a 
straight line segment. Both cases are excluded because our simple cube-curves 
are homeomorphic to the torus. The case m = 3 (a triangle) is possible, e.g. for 
the simple cube-curve shown in Fig. 2 . However, in this minimum case of m = 3 
it holds that no side of the triangle may be completely contained within one of 
the cubes. □ 

A curve contained in g is contractible (into a point) in g iff there are con- 
tinuously repeated topological (i.e. continuous and bijective) transformations of 
this curve into a family of disjoint curves, all contained in g, which converge 
towards a single point [8]. The curves on the left and on the right in Fig. 2 
are non- contractible in g, and the curve in the middle is contractible in g by 
continuous contractions into a single point. 

A Jordan curve 7 passes through a face / iff there are parameters ti,t2,T 
such that {j{t) : ti < t < ^2} ^ and 7(4 — e) ^ /, 7(^2 + e) ^ /, for all 
£ with 0 < £ < T. During a traversal along curve 7 we enter a cube c at point 
7(^1) e c if 7(4 — e) ^ c, and we leave c at point 7(^2) G c if 7(^2 + ^ c, for all 

5 with 0 < £ < T. A traversal is defined by the starting vertex po of the curve 
and the given orientation. 

We consider polygonal curves V. Let C'p = (cq,ci, be the sequence of 

cubes in the order how they are entered during curve traversal. If V is complete 
and contained in a tube g then it follows that C'p contains all cubes of and 
there are no further cubes in Cp. 

Lemma 2. For an MLP V of a simple cube- curve g it holds that Cp contains 
each cube of g just once. 

Proof. Assume that V enters the same cube c of twice, say at point qi first 
and at point Q2 again. Both points may be on one face of c, see Fig. 2 on the left 
and on the right, or on different faces of c, see Fig. 2 middle. 

First consider the case that both entry points qi and q2 of c are on one face / 
of cubes c and c' . Assume the number of passes of V through / is odd. We 
insert points qi and q2 into V as new vertices which split the resulting polygonal 
curve into two polygonal chains, Vi = (^2, •••, ^1) and V2 = (^1, •••, ^2) such that 
the union of both is V. The length of Vi exceeds the length of the straight line 
segment qiq2, for i = 1,2. W.l.o.G. let Vi be the chain which does not pass 
through /. It follows that Vi is complete in g. Because the cube c is convex it 
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also contains the straight line segment qiq2- We replace the polygonal sequence 
^2 by i*e. we replace V by Q = (<?i, <72, •••7 <?i)- Curve Q is still complete 
and contained in g, but shorter than V which contradicts our assumption that 

V is an MLP of g. 

Now assume that the number of passes of V through / is even, it enters c 
at qi^ then it passes / and enters c' at ri, then it passes / again and en- 
ters c at then it passes / again and enters c' at V2- There may be a fur- 
ther even number of passes of V through / before the curve returns to q\. 
We insert points <71 , ri , <72 , ^2 into V as new vertices which split the resulting 
polygonal curve into four polygonal chains, V\ = (gi,...,ri), V2 = (^1, •••, <72), 
= (<?2, •••, '^2) and V2 = (^2, •••,<?!) such that the union of all four is V. It 
follows that 

Cvi C Cp3 V Cp3 c Cpi , 

and an analog conclusion for V2 and V^. W.l.o.g. let C Then we replace 
in V the polygonal chain V\ by the straight line segment q\ri which is in /. 
The length of V\ exceeds the length of the straight line segment qir\. Thus the 
resulting polygonal curve is still complete and contained in g, but shorter than 

V which also contradicts our assumption that V is an MLP of g. 

We consider the second case that both points q\ and q2 are on different faces 
of cube c, say q\ on face fi and q2 on face /2- Because q2 is a re-entry point to 
cube c there must be a point q^x in /2 where we leave c before entering c again 
at q2. If there is another re-entry point on face /2 then we are back to case one. 
It follows that V leaves c once and enters c once. Assume that /2 is also a face 
of cube d ^ c oi g.liV would not intersect the second face of d contained in g 
then we may replace the polygonal subsequence {qex^ •••7 <^2) (which is contained 
in d but not in /2) by the shorter straight line segment qexq2 which is contained 
in /2 and thus in c', i.e. the resulting polygonal curve would be shorter and still 
contained and complete in g. It follows that the curve V has to leave cube d 
through its second face contained in g. Tracing g around means that we arrive at 
the cube d' ^ c which is also incident with face fi , and we leave d' (and enter c) 
at a point which may be equal to <71, and we enter d' again through /i. Thus 

V contains two polygonal subsequences which are both contained and complete 

in g. This contradicts the shortest-length constraint. □ 



2.2 Iterative Modifications 

Now we consider a special transformation of polygonal curves. Let V = 
(Po,Pi, •••7 Pm) be a polygonal curve contained in a tube g. A polygonal curve Q 
is a ^-transform of P iff Q may be obtained from P by a finite number of steps, 
where each step is a replacement of a triple a, 6, c of vertices by a polygonal se- 
quence a, 61, ..., 6/c, c such that the polygonal sequence a, 61, ..., 6/c, c is contained 
in the same set of cubes of g as the polygonal sequence a, 6, c. The case /c = 0 
characterizes the deletion of vertex 6, the case k = 1 characterizes a move of 
vertex h within g, and cases k > 2 specify a replacement of two straight line 
segments by a sequence of /c -f 1 straight line segments, all contained in g. 
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Fig. 3. Critical edges of two cube-curves 



Lemma 3. Let V be a polygonal eurve eomplete and eontained in the tube g of 
a simple eube- eurve g sueh that C'p is without repetitions of eells. Then it holds 
that any ^-transform of V is also eomplete and eontained in g. 

Proof. By definition of the g-transform it follows that this curve is also contained 
in g. Because C'p is without repetitions of cells it holds that V traces g cell 
by cell, starting with one vertex in one cell and returning to the same vertex. 
From Lemma 1 we know that V has at least three vertices, i.e. at least three 
line segments, and that for the minimum case of m = 3 it holds that two line 
segments cannot be complete in g^ i.e. there is at least one cube not intersected 
by these two line segments. Thus a replacement of two line segments (within the 
same set of cells of g) cannot transform V into a curve contractible in g, i.e. the 
curve remains complete in g. □ 

3 Critical Edges 

An edge contained in a tube g is eritieal iff this edge is the intersection of three 
cubes contained in the cube-curve g. Figure 3 illustrates all critical edges of the 
cube-curves shown in Fig. 1. Note that simple cube-curves may only have edges 
contained in three cubes at most. For example, the cube-curve consisting of four 
cubes only (note: there is one edge contained in four cubes in this case) was 
excluded by the constraint n > 4. 

Theorem 1. Let g be a simple eube- eurve. Critieal edges are the only possible 
loeations of vertiees of a shortest polygonal simple eurve eontained and eomplete 
in tube g. 

Proof. We consider arbitrary (flat or non- fiat) simple cube-curves g, i.e. the MLP 
may not be uniquely defined. 

Let V = (po,Pi, - -iPm) be a shortest polygonal simple curve contained and 
complete in tube g, with po = Pm and m > 3. We consider w.l.o.g. the polygonal 
subsequence (po,Pi,P 2 ) of such a shortest polygonal simple curve contained and 
complete in tube g. We will show that pi is on a critical edge. According to 
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Lemma 2 we know that C'p is without repetitions, i.e. we may apply Lemma 3 
for this curve V and tube g. 

We can exclude the case that p\ is collinear with po and p2 , because p\ would 
be no vertex of a polygon in such a case. Three non-collinear points po, Pi, 
and p2 define a triangular region A(po,Pi,P2) in a plane 8 in 71 ^. The following 
considerations are all for geometric configurations within this plane 8. In this 
proof, a boundary point is a point on the boundary dg. 

At first we ask whether pi may be moved into a new point Pnew within the 
triangle A(po,Pi,P2) towards line segment poP2 such that a resulting polygonal 
subsequence (po, ■‘■iPnewi •••7P2) remains to be contained in g. This describes a 
g-transform of P, and the resulting curve would be complete and contained in 
g. It can be of shorter length if the intersection of an ^-neighborhood of pi with 
A(po,Pi,P 2) is in g, for e > 0 . It follows that such a move of pi is impossible, 
i.e. it follows that for any e > 0 there is at least one boundary point q in an 
^-neighborhood of pi and on one of the line segments popi or pip2^ avoiding such 
a move of po into the triangle A(po,Pi,P2)- It follows that pi itself is a boundary 
point. 

The situation of an ^o-neighborhood at point pi is illustrated in Fig. 4 . Angle 
a represents the region not in g. Angles f 3 and 7 are just inserted to mention 
that they may be zero, and their actual value is not important in the sequel. It 
holds a < 7 T because it is bounded by an inner angle of the triangle A(po,Pi,P2)- 

A boundary point may be a point within a face, or on an edge. Assume first 
that boundary point pi is within a face /. Plane 8 and face / either intersect in 
a straight line segment, or face / is contained in 8 . The straight line situation 
would contradict that a < tt in the £o-neighborhood at point pi, and f d 8 
would allow to move p\ into a new point Pnew within A(po,Pi,P2) towards line 
segment poP2 which contradicts our MLP assumption. 

There are three different possibilities for an edge contained in g: we call it 
an uncritical edge if it is only in one cube contained in ^f, it is an ineffective edge 
if it is in exactly two cubes contained in g, and it is a critical edge (as defined 
above) in case of three cubes. Point pi cannot be on an ineffective edge such 
that it is also not on a critical or uncritical edge, because this corresponds to 
the situation being within a face as discussed before. Point pi also cannot be on 
an uncritical edge such that it is also not on a critical edge. Figure 5 illustrates 
an intersection point q with an uncritical edge in plane 8 assuming that this 
edge is not coplanar with 8. The resulting angle a > it (region not in g in an 
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^-neighborhood of q) does not allow that pi is such a point. If the uncritical edge 
is in E then angle a would be equal to tt, what is excluded at pi as well. So there 
is only one option left. Point p\ has to be on a critical edge (in fact, the angle a 
is less than vr for such an edge). □ 

Note that this theorem also covers flat simple cube-curves with a straightfor- 
ward corollary about the only possible locations of MLP vertices within a simple 
square-curve in the plane (see Fig. 6): such vertices may be convex vertices of 
the inner frontier or concave vertices of the outer frontier only because these are 
the only vertices incident with three squares of a simple square-curve. 

4 Application of the Theorem 

Our algorithm [2] is based on the following physical model: Assume a rubber 
band is laid through the tube g. Letting it move freely it will contract to the 
MLP which is contained and complete in g (assumed the band is slippery enough 
to slide across the critical edges of the tube). The algorithm consists of two 
subprocesses: at first an initialization process defining a simple polygonal curve 
Vq contained and complete in the given tube g and such that Fpo contains 
each cube of g just once (see Lemma 2), and second an iterative process (a g- 
transform, see Lemma 3) where each completed run transforms Vt into Vt+i with 
^ t > 0. Thus the obtained polygonal curve is also complete 

and contained in g. This algorithm uses the fact that critical edges are the only 
possible locations of vertices of the desired polygonal curve. This allowed us to 
achieve linear running time. 



4.1 Initialization on Critical Edges 

We sketch the initialization procedure to illustrate the importance of the proved 
theorem. The initial polygonal curve will only connect vertices which are end 
points of consecutive critical edges. For curve initialization, we scan the given 
curve until the first pair (eo,ei) of consecutive critical edges is found which 
are not parallel or, if parallel, not in the same grid layer (see Fig. 1 (right) 
for a non-simple cube-curve showing that searching for a pair of non-coplanar 
edges would be insufficient in this case). For such a pair (eo,ei) we start with 
vertices (po,Pi), Po bounds eo and p\ bounds ei, specifying a line segment popi of 




Fig. 5. Intersection with an uncritical edge 
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Fig. 6. Convex vertices of inner and concave vertices of outer frontier of the tube 
of a simple square curve in the plane 



minimum length (note that such a pair (po^pi) is not always uniquely defined). 
This is the first line segment of the desired initial polygonal curve Vq. 

Now assume that pi-ipi is the last line segment on this curve Vo specified 
so far, and pi is a vertex which bounds e^. Then there is a uniquely specified 
vertex on the following critical edge such that PiPi-\-i is of minimum 
length. Length zero is possible with = pp, in this case we skip i.e. we 
do not increase the value of i. Note that this line segment PiPi-\-i will always 
be included in the given tube because the centers of all cubes between two 
consecutive critical edges are collinear. The process stops by connecting pn on 
edge 6n with po (note that it is possible that a minimum- distance criterion 
for this final step may actually prefer a line between pn and the second vertex 
bounding eo, i.e. not po). See Table 1 for a list of calculated vertices for the cube- 
curve on the left in Figs. 1 and 3. The first row lists all the critical edges shown 
in Fig. 3. The second row contains the vertices of the initial polygon shown in 
Fig. 7 (initialization = first run of the algorithm). For example, vertex b is on 
edge 2 and also on edge 3, so there is merely one column for (2/3) for these 
edges. 

This initialization process calculates a polygonal curve Vo which is always 
contained and complete in the given tube. Note that traversals following opposite 
orientations or starting at different critical edges may lead to different initial 
polygons. For example, a ‘counterclockwise’ scan of the cube-curve shown in 
Fig. 1 (left), starting at edge 1, selects edges 11 and 10 to be the first pair of 



Table 1. Calculated points on edges (‘D’ in stands for ‘deletion’, i.e. there is no 
polygon vertex on this edge anymore) 



critical edge 1 


L 2/3 


4 


5 


6/7 


8 


9 


10 


11 


12/13 


1st run (initialization) c 


i b 


c 


d 


e 


f 


g 


h 


i 


j 


2nd run c 


i b 


D 


D 


e 


D 


D 


h 


i 


j 
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Fig. 7. Curve initializations (‘clockwise’) 



consecutive critical edges, and the generated ’counterclockwise’ polygon would 
differ from the one shown in Fig. 7. Figure 8 shows a curve where the defined 
initialization does not return to the starting vertex. 

Initialization results are shown in Figs. 8 and 9. Note that in case of fiat 
cube-curves the process will fail to determine the specified first pair of critical 
edges, and in this case a 2D algorithm may be used to calculate the MLP of a 
corresponding square-curve. 

4.2 Non-grid-point Vertices of MLP’s 

This initialization procedure is followed by an iterative procedure [2] where we 
move pointers addressing three consecutive vertices of the (so far) calculated 
polygonal curve around the curve, until a completed run t H- 1 does only lead to 
an improvement which is below an a-priori threshold r i.e. l{Vt) — r < /(Pt+i)- 




Fig. 8. Initial polygon (dashed) and MLP. Initialization starts below on the left, 
and the final step of the initialization process would prefer the second vertex of 
the first edge if a shortest-distance criterion would be used only 
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Fig. 9. Initial polygon (dashed) and MLP. Critical edges are shown as short line 
segments. The rest of the tube is not shown 



Figures 8 and 9 show the initial polygon Vi dashed, and the solid line 
represents the final polygon. The short line segments are the critical edges of 
the given tube. For details of the algorithm see [2]. Note that MLP vertices may 
also be in “non-vertex” positions dividing a critical edge into two segments of 
non-zero (rational) length. 

5 Conclusions 

Length-estimators for digital curves in the plane are a well-studied subject. See, 
for example, [6], for references in this field. There are methods for length estima- 
tion in the digital plane which are provable convergent to the true length value 
for specific convex planar sets, assuming finer and finer grids in the plane. Such 
a correct convergence behavior is also supported by experimental evidence [6]. 
Methods showing multigrid convergence in the digital plane are, for example, the 
‘classical’ digital straight line approximation technique (DSS method), the grid- 
continua based minimum- length polygon method (GC-MLP method), see [11], 
and a minimum-length polygon approximation method based on so-called ap- 
proximation sausages (AS-MLP method), see [1] for this AS-MLP method. 

Length-estimators for digital curves in the 3D space may be designed by 
following 2D design principles. The discussed method in this paper expands 
the GC-MLP method, see [12]. Due to this fact it follows that the discussed 
method satisfies multigrid convergence for the special case of flat curves as the 
2D method does for planar curves. This might be sufficient theoretical evidence 
for the convergence behavior of the discussed curve length estimation method. 

The paper has specified an important geometric property of such minimum- 
length polygons in 3D space which has been used in [2] for designing an efficient 
algorithm for calculating such a polygon. The given theorem has been used 
in [2] but without proof and accompanying definitions. In this sense the paper 
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has provided fundamentals for the algorithm discussed in [2] . The given theorem 
might be also of interest in the context of 3D curves in general. 
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Abstract. In this work a new method to detect objects under scaled 
orthographic projections is shown. It also calculates the parameters of 
the transformations the object has suffered. The method is based on 
the use of the Generalized Hough Transform (GHT) that compares a 
template with a projected image. The computational requirements of 
the algorithm are reduced by restricting the transformation to the tem- 
plate edge points and using invariant information during the comparison 
process. This information is obtained from a precomputed table of the 
template that is directly transformed and compared with the image ta- 
ble. Moreover, a multiresolution design of the algorithm speeds-up the 
parameters calculation. 



1 Introduction 

Planar object recognition and three-dimensional (3D) pose estimation are the 
most important tasks in computer vision. Some applications related to these 
methods are object manipulation, autonomous vehicle driving, etc. 

Perspective projection is one of the most suitable models for real camera 
image formation. However, it introduces non-linear relations. Simpler camera 
models can be used [1] under more relaxed situations which produce a negligible 
error. When the size of the object is relatively small (in comparison to its depth) 
and placed near the camera optical axis, the orthoperspective projection (or, in 
general, the subgroup of affine transformations [2]) is a good approximation to 
the perspective projection [3]. Here, the scaled orthographic projection will be 
studied. This projection only incorporates the distance and the foreshortening 
effects, but it eliminates the non-linear dependencies. 

There are several methods to find the transformations between a planar tem- 
plate and an image where this template is included and viewed from an arbitrary 
position [4]. Selection of the most suitable method will be based on considera- 
tions such as the kind of shapes we are dealing with and the ability to cope with 
practical problems such as occlusion, noise, etc. 

A simple approach to object detection is to find, for every possible orientation 
and position, the template transformation that produces a better matching with 
the image shape. However, the search space can become overwhelmingly large. 
An efficient evaluate and subdivide search algorithm is carried out in [5] in order 
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to find the affine transformation that brings the larger number of model features 
close to image features. A variant of the Hausdorff distance is used as a similarity 
measure between the two point sets. 

Search space can be reduced if invariant features are used. Geometric invari- 
ants are shape descriptors that remain unchanged under geometric transforma- 
tions such as changing the viewpoint. Algebraic invariants are obtained using 
the classical results derived from perspective geometry of algebraic curves. The 
fundamental invariant in projective geometry is the cross-ratio (CR). Guo Lei [6] 
uses the CR to recognize 3D views of different polygons. B.S. Song et al. present 
in [7] a target recognition method based in CR that selects stable points for 
complex scenes. More general algebraic invariants can be derived from configu- 
rations of conics [8], points [9] and lines [10]. Besides the algebraic invariants, 
Rothwell et al. use a canonical frame, invariant to perspective transformations 
of the plane, to implement index functions that select models from a model 
database, as part of a recognition system [11]. 

Differential invariants can also be applied. In [12], affine invariants requiring 
one point correspondence and second order derivative, or requiring two points 
correspondences and first order derivative, are used to determine whether one 
curve may be a perspective projection of another one. 

Other methods based on Fourier Descriptors [13] are aimed to exploit view- 
point invariants, but they are not fully invariant in perspective transformations. 
Furthermore, they are not robust when occlusion appears. 

Several methods have been proposed to detect the 3D pose of a planar object 
based on the Hough transform (hashing methods). Using the HT we can take ad- 
vantage of its useful properties, like relative insensivity to noise, and robustness 
to occlusions. However, it needs high computational and storage requirements. 

In this paper we undertake a new approach to planar object detection based 
on the Generalized Hough Transform (GHT) that compares template and im- 
age information in order to calculate the transformations between the template 
and the corresponding image shape. Computational and storage requirements 
are greatly reduced by using invariant template and image information derived 
from the gradients and positions associated with the edge points. Invariant in- 
formation is stored in template and image tables which are compared during 
the detection process. Template (or model) table information will be directly 
modified during algorithm application without needing further computationally 
expensive shape processing. 

The rest of the paper is organized as follows. Next section introduces the 
mathematical expressions involved in an orthoperspective transformation. Sec- 
tion 3 presents the new method for planar shape detection and the expressions 
that allows us to speed-up the generation and comparison of the tables. In section 
4, several real experiments have been carried out in order to test the algorithm’s 
behaviour. Finally, in section 5 several related works are analyzed more deeply. 
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2 Scaled Orthographic Projection 

The necessary transformations to project a planar object into the image plane 
are presented here, where / is the focal distance of the camera lens, d is the 
distance between the focal point and the intersection of the object plane with 
the z — axis (the optical axis), and n being the normal vector to the object 
plane. The projection of this vector onto x — y and x — z planes allows us to 
determine the pan, r, and the tilt, 5^ angles, respectively. Thus, r is the angle 
between the projection of n onto the x — y plane and the x — axis. On the other 
hand, 8 is the angle between the projection of n onto the x — z plane and the 
z — axis. 

The relationship between an object point {xi^yi^Zi) and its corresponding 
image point (ui,Vi,f) can be expressed using the orthoperspective transforma- 
tion: 



Ui= f ■ 
Vi= f ■ 



Xi • cos8 • COST — yi • sinr 

d 

Xi • cos8 • sinr + yi • cost 
d 



( 1 ) 



3 Planar Object Detection 

The detection process must find the occurrence of a template in an image. Six 
different groups of parameters indicate the transformation to be applied to the 
template to generate the object in the image. 1) Scaling of the template in the 
object plane represented by matrix S. 2) Displacement along the object plane, dx 
and dy^ represented by matrix D. 3) Rotation in the object plane, /3, represented 
by matrix Ry. 4) Tilt angle around object plane x — axis^ 8, represented by ma- 
trix Rs‘ 5) Pan angle around object plane z — axis, r, represented by matrix R^. 
6) Scaling of the scaled orthographic projection, represented by matrix So- 
After application of the previous transformations to the template, the new 
coordinates of the template in the image plane are given by: 



{ui,ViY = So ■ Rt ■ Rs ■ R0 ■ D ■ S ■ {xi, yif (2) 

Note that the only transformation that introduces a distortion in the image 
object is the application of the tilt angle. The rest of transformations only change 
either the size (scaling), the orientation or the position. 



3.1 Invariant Transformation 

Recent work with the GHT [14] shows a new method to detect bidimensional 
shapes that uses invariant information to displacement and scaling. 

The edge points of the image are characterized by the parameters < x,y, 6 > 
where x and y are the coordinates of the points in a two-dimensional space and 
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Q is the angle of the gradient vector associated with this edge point. An angle, 
called difference angle is also defined. Its value indicates the positive difference 
between the angles of the edge point gradient vectors that will be paired. 

From this description we can derive a transformation from the original image 
that generates new invariant information for the displacement and the scaling, 
based on paired points. Thus, let pi and pj be two edge points, < Xi^yi^Oi >, 
< Xj , Pj , Oj > their associated information, and ^ the difference angle to generate 
the pairing. Then, the transformation T can be expressed as follows: 



'T{pi,Pj 




0 



Oj 

elsewhere 



(3) 



where 

aij = Tarctan^^^ — ^ Z 0^ (4) 

V ~ Xj J 

that is, aij is the positive angle formed by the line that joins pi and pj and the 
gradient vector angle of the point pi. 

The information generated by the application of the previous transformation 
is stored in a table in order to improve the detection process speed. The mul- 
tivalued characteristics of the previous transformation will be apparent during 
table building. Next, we show the contents of this table: 



Orientation table (OT), The information generated by the T transformation 
is contained by this bidimensional table. The aij and 6j values are stored 
in rows and columns, respectively. When a pairing with aij and Oj value is 
calculated, the OT[aij][6j] position is incremented. Because different pairings 
might coincide with the same ai and Oij values, the content of OT[aij][6j] 
will indicate how many of them have these values. The information stored 
in this table is invariant to scale and displacement. Note that a rotation r 
of an image in a plane causes a rotation of r columns of its OT . 



Then, we can use the OT table to compare invariant information of the 
template and the image under scaled orthographic projection. Before starting 
the generation of the OT tables, the template must be transformed as follows: 



{Ui, Vif =Rs- R/3- {Xi,yif (5) 

In this way, the projected template will be similar to the image shape except 
for a different scaling, displacement and orientation. Then the template and 
image OTs will be different by a rotation of r columns. This displacement can 
be calculated applying a matching process for both tables [14]. 

In general situation, where the values for the {5 ^(3) angles are unknown, a OT 
table for the template needs to be generated for each (6, (3) value. Each template 
table is compared with the OT table of the image and a value for r is calculated, 
see Fig. 3.1. The maximum voting will indicate the correct solution. 

The table generation and comparison for each 8 and (3 values may require a 
high computational complexity. However, several improvements have been car- 
ried out to obtain a good performance. 
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Fig. 1. Block-Diagram of the system 



3.2 Table Generation and Comparison 

The new OT table generation is accomplished by only using the edge points 
of the template shape. Normally, the number of edge points in the template is 
lower than in the image, where several different shapes can appear. This makes 
the computational complexity of our method to be lower than other algorithms 
using the image shape [15]. 

On the other hand, the OT table construction is based on both the calcula- 
tion of the gradient vectors of the edge points {6) and the value for the difference 
angles (a). This calculation can take a high computational time. In order to re- 
duce this time, we have studied the modifications in an angle value that arise 
under a scaled orthographic projection. This allows to generate a OT table for a 
concrete 5 and f3 value by directly transforming the original OT template table. 



Modification of the Gradient and Difference Angles 

Lemma 1. Let 6i be the gradient angle of the template edge point (x^, yi). Then, 
the transformed gradient veetor, Oti, after applieation of expression (5) is : 



0Ti = atan(tan(/l + 6i)/cos5) (6) 

An important consequence of the previous lemma is that if (x^, y^) and (xj , yj) 
are two paired points in the template using a pairing angle of the transformed 
points after applying expression (5) are also paired. 

Lemma 2. Let ai he the difference angle value between two paired points using a 
pairing angle of Then, the transformed difference angle, aTi, after application 
of expression (5) is: 



an = atan(tan(/l T Oi T af)! cos(^) — Bti 



( 7 ) 
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The demonstrations of the previous lemmas are straightforward, so we have 
omitted them due to space limitations. The two previous lemmas allow the cre- 
ation of the new OT tables for each 6 and f3 angles without needing the calcula- 
tion neither the projected edge points nor the new difference angle for the paired 
points. The OT tables are sparse, so only the positions with values different from 
zero are stored. This reduces the required storage and allows the implementation 
of a more efficient transformation and comparison process. 



3.3 Pose and Position Estimation 

A matching algorithm has to compare, by columns, the transformed template OT 
table - for each possible tilt, pan value - and the OT of the image for different 
shifting values in order to calculate the pose (tilt, pan and rotation angles). The 
scale and the displacement values are obtained, after eliminating the previous 
transformations in the contour of the tested image, in the same way as in [14]. 

4 Experimental Results 

As we showed in the previous sections, the detection process is based on the use 
of the gradient and difference angles associated with the edge points. Then, the 
accuracy of our method will be limited by the accuracy of the angle detection. 
Good initial values for the angles and edge points are obtained by applying a 
Canny operator. 

We have used different images in order to test the behaviour of the whole 
detection process. All the examples have been executed in a SGI Workstation 
with a R- 10000 processor at 225 MHz. The range of the tilt, rotation and pan 
angles for the experiments are (0°, 60°), (-30°, 30°), and (0° , 360°), respectively. 
Images are 8 bit greyscale with half PAL size (384x288 pixels). Model images 
are acquired in a fronto-parallel view. There is no need for any camera calibra- 
tion procedure. The pairing angles have been chosen taking into account the 
template’s shape [14], in a way that a significant subset of the contour points of 
the template are paired. For this purpose the gradient’s histogram of the shape 
must be studied. 

In addition, the algorithm uses a multipass approach to accelerate the pro- 
gram execution. Starting with a coarse estimation of the pose angles, successive 
steps focus in a narrower interval around the solution estimated in the previous 
step. Finally, the scale and displacement values are calculated. 

The accuracy of the detection process is checked by using the likelihood 
function stated in [16]. First, this function measures the similarity, attending to 
edge points localization and gradient angle values, between transformed template 
and image contours. A visual checking is also carried out by superimposing the 
projected template shape on the image one. 

First, we have applied the detection to B&W model images that have been 
projected using a warping process (Fig. 4). In this manner, we can know the 
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Table 1. Parameters of the warping deformation for the test images 



Template 


"^POINTS 


^POINTS 




Scale 


Likelihood (%) 


Moon 


801 


602 


(45,10,-20) 


0.90 


87.37 


Indalo 


1231 


1312 


(39,13,29) 


1.15 


83.58 


Clover 


568 


599 


(50,19,-29) 


1.20 


82.39 


Key 


622 


531 


(34,-28,13) 


0.92 


92.43 



exact tilt, rotation, pan and scale values which have been used and check the ac- 
curacy of the detection. This allows us to study the theoretical behaviour of our 
programs. The template properties and experiment configurations are showed in 
Table 1. The first and second columns indicate the number of points detected 
by the Canny operator in both the template and transformed template. The 
following two columns show the parameters of the projection of the template 
by a warping process. Finally, in the last column, the value of the maximum 
theoretical likelihood is expressed. This value is calculated comparing the pro- 
jected template with the warped image for the optimal pairing angles using the 
exact transformation parameters in pose and scale (the displacement has to be 
estimated). A 100% likelihood is not reached due to the discrete nature of the 
image transformation. 




Fig. 2. Warped images 



Table 2 summarizes the parameters used in program execution and the results 
obtained for the detection proposes, (^i, ^ 2 ) are the used pairing angles. When 
two pairing angles are used, the achieved accuracy is better in general. The 
((^, /I, r), scale and the likelihood for the detected parameters are depicted in 
the subsequent columns. Finally, the computational time for the multipass and 
sequential algorithm is shown in the last two columns of Table 2. An important 
speed-up is achieved for the multipass approach. 

Finally, several real images, obtained using a CCD camera, are presented in 
Fig. 3 in order to show the behaviour of the proposed algorithms in practical 
situations. The images in the left column are the templates used and the images 
in the right column shows a visual checking of the solution, by superimposing 
the projected template on the image. 








486 



Julian Ramos Cozar et al. 



Table 2. Results obtained for warped images 



Image (points) 


(6, 6) 


(5,/3,t) 


Scale 


Like. (%) 




Timeseg (s.) 


Mooni 


(180, -) 


(45,10,-20) 


0.90 


87.37 


0.20 


3.73 


Moou2 


(180, 90) 


(45,10,-20) 


0.90 


87.82 


0.48 


9.10 


Indaloi 


(180, -) 


(39,10,-32) 


1.15 


71.80 


0.51 


9.72 


Indalo 2 


(175, 125) 


(39,13,-29) 


1.15 


83.58 


0.96 


18.91 


C lover 1 


(180, -) 


(50,18,-30) 


1.20 


78.68 


0.14 


2.74 


Clover 2 


(45, 90) 


(50,25,-27) 


1.12 


63.74 


0.37 


7.13 


Keyi 


(180, -) 


(34,-30,11) 


0.92 


94.63 


0.12 


2.36 


Key2 


(180, 150) 


(33,-25,17) 


0.92 


88.37 


0.30 


5.63 



Table 3. Practical situation 



Image (points) 


(6, 6) 




Like. (%) 


Time (s.) 


Pliers (1803) 


(180, 135) 


(20,-19,-40) 


73.13 


1.41 


Jewish Harp (1483) 


(180, 45) 


(43,-24,8) 


81.27 


0.91 


Cutter (1650) 


(180, 135) 


(20, -6,9) 


73.46 


0.47 


Scissors (2561) 


(45, -) 


(12,35,4) 


80.47 


0.67 



Table 3 summarizes the more important configuration values. Now, for real 
images, two pairing angles, except for Scissors^ are necessary if a reasonable 
precision is looked for. The images were acquired from an unknown arbitrary 
point of view, so the detected pose in the second column is only for informative 
purposes. The values for the likelihood function reported by the algorithm show 
that good accuracy have been achieved, in comparison with the values reported 
for the warped images. Finally, the computational times are presented for the 
multipass algorithm. 

5 Related Works 

In this section, a brief review of another works that use the HT to estimate the 
3D orientation and position of a planar patch is presented. Others papers, using 
the HT for detecting 2D transformations are not covered here [17,18]. 

The method described in [19] claims to be the first one in using the HT 
for planar object 3D pose estimation. It consists in mapping the classical 2D 
Hough space into another Hough space of the same dimensions using only ge- 
ometrical considerations. The new space is related to the former through a 3D 
transformation that takes into account the orientation and location of an object. 
In this way, data and model are transformed in a common representation space, 
where a direct matching can be carried out by determining the pose of the actual 
object with respect to the known pose of the model. The method is based on 
the classical HT to detect straight lines, and hence it deals with rigid planar 
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Fig. 3. Practical situation 
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objects bounded by straight segments. Planar shapes bounded by curvilinear 
edges, polyhedra and scenes containing several objects can also be tackled, in an 
indirect and more complex fashion, but they are not addressed in the work pre- 
sented. The main limitation of this approach, as pointed by the authors, lies in 
the computational complexity of the matching process, which involves exploring 
a 6-dimensional space for pose estimation. 

A new version of the GHT, called PTIGHT [20], is proposed for detecting 
a perspectively transformed planar shape in a perspective image that is taken 
from an unknown viewpoint. In order to build a perspective reference table from 
a given template, all possible perspective shape images are derived by applying 
forward perspective transformation on the template for all viewing directions and 
positions. The overlapping of this information imposes two restrictions. First, the 
positions in the HT with votes greater than a threshold are considered potential 
candidates. This hypothesis must be verified by back-projecting the image shape 
over the template in order to find the best solution. The resolutions than can be 
achieved are poor, 5° for the tilt and pan angles in the reported results. Although 
the images tested there are not complex, the computational time is high. Also, 
a camera calibration is required in order to know the focal distance. 

The algorithm presented in [15] detects planar objects that are rotated a 
tilt and pan angle, and distorted by a perspective projection before applying a 
chain of 2D transformations. Our method reduce the computational complexity 
of this approach by using a less complex perspective model and restricting the 
transformation to be applied to the template table which, usually, has a lower 
number of entries than the image table. 

Finally, a technique dealing with parallel projected planar objects with ro- 
tational symmetry is presented by Yip [21]. The main contribution of the paper 
is that it provides a method to reduce the dimensionality of the HT space by 
breaking it down into several lower order ones. The problem is the shape form 
and projection model restrictions it imposes. 

6 Conclusions 

A new method to detect planar shapes under orthoperspective projection has 
been presented. The method copes with a situation in which the shape of the 
projected image presents a different displacement, orientation and scaling in re- 
lation to a template. The method is based on the GHT and generates invariant 
information, by using gradient information, that allows us to uncouple the pa- 
rameter calculation and, in this way, reduce computational complexity. A new 
table generation is accomplished by only using the edge points of the template 
shape. Generally, the number of edge points in the template is lower than in the 
image, making the computational complexity of our method to be lower than al- 
gorithms using the image shape. Important improvements have been introduced 
to save gradient angle calculation of the projected points. Several examples, that 
show the accuracy of the algorithm with real images, have also been presented. 
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Abstract. The convexity of a discrete region is a property used in nu- 
merous domains of computational imagery. We study its detection in the 
particular case of polyominoes. We present a first method, directly rely- 
ing on its definition. A second method, which is based on techniques for 
segmentation of curves in discrete lines, leads to a very simple algorithm 
whose correctness is proven. Correlatively, we obtain a characterisation of 
lower and upper convex hulls of a discrete line segment. Finally, we evoke 
some applications of these results to the problem of discrete tomography. 

Keywords: Discrete Convexity, Segmentation, Discrete line, Polyomi- 
noes, Discrete tomography 



1 Introduction 

Discrete convexity intervenes in numerous domains with regard to geometry and 
particularly to image processing [4] . It is an important property of plane figures 
which permits, for instance, methods for geometrical shapes regularisation. The 
notion of discrete convexity is strongly linked with the paradigm of discrete 
lines. This is underlined in this article where a simple and efficient algorithm for 
detection of polyomino convexity is presented. Polyominoes are objects in which 
any couple of cells may be linked through a path containing only horizontal and 
vertical moves (4-connectivity). After having checked the hv-convexity (cells of 
each column and each row are consecutive) of a polyomino P, convexity is checked 
on the curve points of the border characterising it. 

The proposed method uses a variant of the linear algorithm for segmentation 
of curves in straight lines given in [7,6]. The eventual non-convexity of P is 
detected by the algorithm, during its scanning of curves of the border of P; if 
the whole border is scanned, P is convex. The proof of this algorithm uses a 
result on convex hull of a discrete segment presented in this article. 



G. Borgefors, I. Nystrom, and G. Sanniti di Baja (Eds.): DGGI 2000, LNGS 1953, pp. 491-504, 2000. 
© Springer- Verlag Berlin Heidelberg 2000 



492 



Isabelle Debled-Rennesson et al. 



This technique and methods directly coming from definitions of discrete con- 
vexity have been used for the study of the reconstruction of 2-dimensional dis- 
crete sets from their horizontal and vertical projections. In particular, we studied 
the reconstruction of convex polyominoes. The approaches we use are presented 
at the end of this article. This work enters in the more general framework of 
discrete tomography whose applications are numerous, particularly in data com- 
pression, image processing, or still in medical imagery for medical diagnosis help 
in radiography. 

In the second section, we introduce miscellaneous definitions of discrete con- 
vexity. Then a first simple method, deduced from a definition, is proposed. In 
the following section, fundamental elements of discrete geometry are given so 
that we obtain a very efficient algorithm for convexity detection on hv-convex 
polyominoes. Then, we propose a use of these methods in discrete tomography. 
At last, a conclusion and further research prospects are given. 

2 Discrete Convexity 

Convexity is well defined in the continuous case but in the discrete one, several 
definitions exist. The studied discrete figures are finite 8-connected subsets of 
discrete points in the plane and are named discrete regions. 

In 1970, Slansky [18] defines a discrete region as convex if and only if there is 
a convex (Euclidean) region whose image (after digitizing) is this discrete region. 
This definition depends on the digitizing process used. 

On the other hand, Minsky and Papert [14] gave the following definition of 
the convexity of a discrete region R: R is convex if and only if there is no triplet 
of colinear discrete points (ci, C 2 , cs) such as ci and cs belong to R and C 2 belongs 
to the complementary of R. 

Then, Kim and Rosenfeld [12,13,11] have given several equivalent definitions. 
They have shown that a discrete region R is convex 

— on the one hand, if and only if its convex (Euclidean) hull does not contain 
any discrete point of the complementary of R. 

— on the other hand, if and only if it fulfils the area property i.e. if and 
only if, for all points pi and p2 of R, P(R,pl,p2) does not possess any point 
of the complementary of R, where P(R,a,b) represents the polygon whose 
edges are made by the segment ab and the edges of R (see Eig. 1). 

These last two definitions shall be used in the study of the convexity of 
polyominoes. We refer to Kim and Rosenfeld [12,13,11] for a comparison of their 
definitions with Minsky and Papert ’s one. 

A hv-convex polyomino is a polyomino whose cells of each column are con- 
secutive (v-convexity) as well as those of each row (h- convexity). 

Some immediate properties may be deduced from this definition: 

- a hv-convex polyomino is a discrete region ’’without hole”, 

- a convex polyomino is hv-convex. 

The convexity study of a polyomino so starts by checking its hv-convexity, this 
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Fig. 1. On the left, a non-convex discrete region. On the right, a convex discrete 
region 



is done through a simple scanning of each row [4]. In the following, we study 
the convexity of hv-convex polyominoes. It is clear that this study may there- 
fore be reduced to the convexity study of the limit or border of the hv-convex 
polyominoes (see Sect. 3 and 4). 

3 Direct Use of the Area Property to Detect the 
Convexity of a /i?;- Convex Polyomino 

Let T be a hv-convex polyomino, anchored at (/c, /), i.e., containing in its first 
column a cell at row k and in the last column a cell at row 1. We call left limit 
of T, and we note L, the set of cells with minimal column index of each line. We 
define in the same way the right limit R of T. We distinguish in L the higher part 
LI, included between the first line and the left anchorage /c, and the lower part 
L2, included between the line k and the last line. In the same way, we distinguish 
in R the parts R1 and R2 limited by the right anchorage I (see Fig. 2). 




Fig. 2. Limits of the polyomino T 



To check that T is convex, we only have to apply the area property on LI, 
L2, Rl, and R2. So T is convex if and only if, for every couple (M, N) of points 
of LI, L2, Rl, or R2, and every y included between the row indexes of M and N, 
there is no discrete point whose row index y is located between the segment MN 
(inclusive) and the polyomino T (exclusive). Let (xi^yi) be the coordinates of 
M, (x 2 , y 2 ) the coordinates of N. The coordinates of the point P of the segment 
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MN with row index yi < y < y2, are with x = xi + 

order that T be convex, it is necessary and sufficient that: 

i. if M and N belong to LI or to L2, the discrete point with coordinates ( [x] , 
located to the right of the point P, belongs to T, 

ii. if M and N belong to R1 or to R2, the discrete point with coordinates ( [xj , y), 
located to the left of the point P, belongs to T. 

If the hv-convex polyomino T is represented by a 0-1 matrix whose dimension 
is m>Kn, the computation of its limits is in 0(m + n), m and n being the numbers 
of rows and columns of T, respectively, and the tests on couples of points are in 
0(min(m^, n^)). Indeed, when m > n, it is permitted to exchange the roles of 
columns and rows. 

4 Use of Discrete Lines to Detect the Convexity 

Let us consider a hv-convex polyomino included in a minimal rectangle Rec 
whose size is m>Kn. Let ([A,A’], [B,B’], [C,C’], [D,D’]) be the intersection between 
the border of the polyomino and Rec (see Fig. 3). By considering the pixels of 
the border, the points A, A’, B, B’, C, C’, D and D’ delimit 8-connected curves 
of pixels which characterise the hv-convex polyomino. These 4 curves cl, c2, c3, 
and c4 are made of the points of the polyomino border being respectively located 
between A’ and B for cl, B’ and C for c2, C’ and D for c3 and between D’ and 
A for c4, respectively. 
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Fig. 3. A hv-convex polyomino, gray pixels represent the curves cl, c2, c3, and c4 



As it has been indicated in section 2, to check the convexity of a hv-convex 
polyomino, we only have to consider the border; curves cl, c2, c3, and c4 must 
therefore be studied. The principle consists in segmenting curves cl, c2, c3, and 
c4, the eventual non-convexity of a polyomino shall be detected very simply 
through this operation. 



Detection of the Discrete Convexity of Polyominoes 495 

In the following paragraph, the notion of discrete line [16,15,7] is recalled 
as well as some properties of these object types with, particularly, a new result 
on the construction of the convex hull of a discrete line segment. Moreover, an 
algorithm of discrete line segment recognition, with its use in the detection of 
convex polyominoes and its adaptation to the problem are presented. 

4.1 Discrete Lines 

The arithmetic definition of a discrete line was introduced by J.P. Reveilles 
[16,15,7]; 

Definition 1. A discrete line with a slope | with b ^ 0 and pgcd(a,h)= 1, 
with lower bound /i, arithmetieal thiekness uj, is the set of points (x, y) of 7Z^ 
whieh satisfies the double diophantian inequation 

ji< ax — by <ii-\-uo 
with all integer parameters. 

We note the preceding discrete line V{a,b, fi.uj). We are mostly interested 
in naive lines which verify uj = sup{\a\, |6|) (see Fig. 4), we shall note them 
V{a,b,fi). To simplify the writing, we shall suppose in the following that the 
slope coefficients verify 0 < a and 0 < 6, therefore u; = max(a^ b). Real straight 
lines ax — by = y et ax — by = jj, UJ — 1 are named the leaning lines of the 
discrete naive line V{a^ 5, y). An integer point of these lines is named a leaning 
point. 

The leaning line located above (resp. under) V in the first quadrant (0 < a 
and 0 < 6) respects the following equation ax — by = y (resp. ax — by = /i+o; — 1), 
it is named upper leaning line (resp. lower leaning line) of V. All points of 
V are located between these two lines at a distance strictly lower than 1. It is 
clear that every segment of a discrete line is a convex region. 




Fig. 4. A segment of the naiVe line D(5, 8, —1) for x G [0, 15], the upper leaning 
points are in dark grey and the lower leaning points are in light grey, leaning 
lines are dotted lines 



A naive line V may be seen as a set of the integer points taken on the union 
of real lines ax — by = c with c = y^ y 1 ,..., /i + a; — 1. We name o-levelled 
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dotted line of the line the set of integer points located on the real line 
ax — by = a. 

Definition 2. r{M) is named the remainder at point M(xM^yM) with re- 
spect to V and is defined by: 

r(M) = axM - byM. 

Proposition 1. Let MqMi be a segment of the line V{a^ 6, /i)^ the lower con- 
vex hull of the MqMi points is the polygonal eurve going through the following 
points: 

— The first and the last lower leaning point of the segment, named Lp et Lp- 

— Between Mq and Lp : the (Ni) sequenee with xmq ^ XNi ^ xp^ , Nq = Mq 

sueh that x^i < ci'^d r{Ni) < r(Nipi) < /i + a; — 1 

— Between M\ and Lp •* the {Pi) sequenee with xmi ^ xp. > xpj^, Pq = Mi 

sueh that xp. > and r{Pi) < r{Pipi) < /i + a; — 1. 

(see Fig. 5 for an example). 



Remark 1. 

1. Colinear points of the Ni and Pi sequences may simply be omitted in the 
characterisation of the lower convex hull. 

2. To construct the upper convex hull of a discrete line segment, the Ni and Pi 
sequences are determined in the same way from the upper leaning points. 



-4 




Fig. 5. Segment of the naive line T^(5, 8, —4), the lower convex hull of the segment 
is drawn, the value of the remainder is indicated for each point 
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Proof. All points of V are located above the lower leaning line, the segment 
points located between Lp and Lp are therefore above the segment LpLp and 
the points Lp and Lp belongs to the lower convex hull of the segment. 

Let us consider now the part of the segment between Mo and Lp. Between 
these two points, we find at most uj — 2 points, with all different remainders. 
Let Ni be a point of the sequence different of and Nipi the first point 
encountered since Ni such that Nipi belongs to the interval [r{Ni), /i + a; — 1], 
the segment points between Ni and Nipi are located above the dotted line going 
through Ni and therefore above the segment NiNipi. 

Let Ni, Nipi, Nip 2 , be three consecutive points of the sequence. Let us prove 
that Nip 2 is located above or on the straight line NiNipi, thus we shall have 
demonstrated the convexity of the sequence of points Ni. Two cases must be 
studied: 

Case l.lfr{Nipi)-r{Ni) < r{Lp) -r{Nipi) dind xp^ > (see Fig. 

6), the point N-, symmetrical point of Ni with respect to Nipi, is a point on 
the segment located between Nipi and Lp. Let us suppose that Nip 2 is a point 
located between the dotted lines of level r{Nipi) and r(A’') such that xn._^^ < 
XNi +2 < xn'.‘ The symmetrical point ^"'^2 with respect to Nipi is a 

point of the segment whose remainder is included between r{Ni) and r{Nipi) 
and whose x-coordinate is included between those of Ni and Nipi, which is 
contradictory with the hypotheses on Ni and Nipi. Necessarily Nip 2 = N^. 
Case r{Nipi) —r{Ni) < r{Lp) —r{Nipi) and xp^ < 2 xni^i ~^Ni may not occur 
(property related to Klein Theorem and discrete lines [6]). 

Case 2. If r{Nipi) —r{Ni) > r{Lp) —r{Nipi), Nip 2 is, by construction, a point 
located between the dotted lines of level r(Nipi) and r{Lp) such that < 

XNi +2 ^ ^Lf' Let us suppose that Nip 2 is located under the NiNipi line, by using 
the symmetrical point of Nip 2 with respect to Nipi we also get a contradiction. 
A similar reasoning may be applied to the sequence (Pi) between Mi and Lp. 

□ 




Fig. 6. Figure illustrating proof of Proposition 1 (Case 1) 



498 



Isabelle Debled-Rennesson et al. 



4.2 Discrete Line Segment Recognition 

Let us consider Z’(Mo,Mi) a segment of D, discrete naive line with charac- 
teristics a, 6, /i, 0 < a and 0 < 6, Mq and Mi are respectively the first and 
the last point of the segment. Let us suppose that the point M{xm,Vm) (with 
M — Ml + (1, 0), M — Ml -h (1, 1) or M = Mi + (0, 1) ) is added to E. Is 
E' = EU {M} a straight line segment and, if it is the case, what are its charac- 
teristics a', 6', /i' ? 

This problem is solved in [7,6] and relies on some particular leaning points of a 
discrete line. They are the leaning points located at the extremities of the seg- 
ment currently being recognised. We note Up (resp. Lp) the upper (resp. lower) 
leaning point whose x-coordinate is minimal. In the same way, we note Up (resp. 
Lp) the upper (resp. lower) leaning point whose x-coordinate is maximal. 

Theorem 1 ( [7,6]). Let us consider r{M), the remainder at point M {x m i y m) 
with respect to V (r{M) = axM ~ )• 

(i) df jj, < r{M) < /j, cj, then M G V{a^ E VJ {M} is a segment of 
the straight line V. 

{ii) If r{M) = /i — 1, then E U {M} is a segment of the straight line whose 
slope is given by the vector UpM. 

(in) If r{M) = y ^ w, then E U {M} is a segment of the straight line whose 
slope is given by the vector LpM. 

{iv) If r{M) < y — 1 or r{M) > y-\- ou, then E U {M} is not a segment of a 
discrete line. 

Remark 2. In the case (iv) M is called strongly exterior to V and: 

- If r(M) < y — 1 (M is above Z’), there is an integer point between UpM 
and E. 

- If r(M) > y (jj (M is under Z), there is an integer point between LpM 
and Z. 

The existence of these points is demonstrated at page 644 of [7]. 

Theorem 1 allows us to obtain an incremental algorithm (see Fig. 7) of 
discrete line segments recognition by scanning a sequence of 8-connected pix- 
els named discrete path. A linear segmentation algorithm for 8-connected 
curves [7,6] is immediately deduced from this result by considering the longest 
segments, last point of a segment being the first one of the next segment. To 
detect the convexity of a polyomino, we use a variant of this algorithm which is 
presented in the following section. 

4.3 Use of a Segmentation Algorithm for the Detection of Convex 
Polyominoes 

Detecting the convexity of a hv-convex polyomino P (see Fig. 3) consists in 
studying its convexity at the neighbourhood of each section of its borders cl, 
c2, c3, and c4. In this whole paragraph, convexity is described for the discrete 
curve cl, detection for the other curves is deduced by symmetry. 



Detection of the Discrete Convexity of Polyominoes 



499 



Algorithm AddPoint 

remainder = axM — byivi ; 

If IJL < remainder < uj 

If remainder = /x then Ul = M Endif 
If remainder = /x + a; — 1 then Ll = M Endif 

else 



If remainder = /x — 1 then 
Lf = Ll ; 

Ul = M ; 

a = Iym - yup I ; 

b = |XM - xup I ; 
fjL = axM - byM ; 

else 

If remainder = /x + a; then 
Uf = Ul ; 

Ll = M ; 
a = Iym - yip I ; 

b = |XM - XLp I ; 

/X = axM - byM - sup(|a|, |b|) + 1 ; 

else 

the new point may not be added to the segment 

Endif 

Endif 



Endif 



Fig. 7. Algorithm adding a point M to the front extremity of a segment of a 
naive line V{a^ b, /i) in the first quadrant 



A curve is said lower convex if there is no discrete point between itself and its 
lower convex hull. 

We must therefore prove that cl is lower convex. 

The algorithm for discrete line segments recognition is used on cl, it scans 
points of cl one after another and stops when the added point may not belong to 
a discrete line segment containing all points already scanned plus the one added. 
The recognition shall not continue for a new segment at this added point but at 
the last lower leaning point of the segment which has just been recognised. When 
such a reject forces a segment change, the non convexity may be detected. The 
rejected point is located strongly above or strongly under the segment. Thanks 
to Remark 2, there is, in the second case, an integer point located between the 
discrete segment already scanned and the real segment from M to the first lower 
leaning point. This segment to which M is added is therefore not lower convex 
and so neither is cl. In this case, the algorithm stops on a non-convexity state- 
ment of the polyomino. 

Let us consider the procedure Recognizesegment(Mo) whose input is a point Mq 
of the curve cl and which then adds the next points of cl from Mq as long as 
the scanned set of points is a discrete line segment. This procedure outputs M, 
the last point tested, which is either the rejected point, or the last point of cl, 
the characteristics (a, b, /i) of the scanned segment with a and b positive, and the 
last lower leaning point Ll as well as a boolean variable end, equal to false while 
cl has still not been completely scanned. 
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Definition 3. Let S be a diserete line segment obtained by the proeedure Rec- 
ognizesegment^ we eall reduced segment of S and we note S^, the segment eon- 
taining the points of S loeated before the last lower leaning point existing in S, 
inelusive of this point. 



Algorithm SegConv 

Input : cl, 8-connected sequence of points of the first quadrant 
convex = true 
end = false 

firstpoint= first point of cl 
While convex and not end Do 

Recognizesegment(firstpoint)^ (M, a, b, /i, Lt.end) 

If axM — byM > /X + max(a, b) then 
CO nvex= false 

Else 

firstpoint=LL 

Endlf 

EndWhile 

Fig. 8. Algorithm testing convexity of the part of a polyomino in the first quad- 
rant 



Theorem 2. A eurve el of the first quadrant is lower eonvex if and only if the 
SegConv algorithm eompletely seans el (eonvex=true at the end). 

Proof. Let us consider a point M which may not be added to the current segment 
with characteristics a, 5, /i such that axM — byM > /i + max(a, 6); then, according 
to Remark 2, there is an integer point between the current discrete segment and 
the straight line segment LpM] therefore, according to the area property, cl 
is not convex. In other words, when the algorithm stops because the variable 
convex has been set to false, cl is not convex. 

Let us suppose that cl is completely scanned by the algorithm SegConv (see 
Fig. 8), then, all changes of straight line segments have been done on points M 
which verify axM ~ byM < jx — 1. Each point is therefore located above the 
segment previously scanned. 

Let SI and S2 be two segments successively recognised by SegConv during the 
scanning of cl. SI characteristics are ai,6i,/ii and its slope is ai (=fy)- Let us 
consider Lil the last lower leaning point of SI and M the rejected point which 
does not belong to SI. 

Let ST be the reduced segment of SI. Let us prove that the last edge of the 
convex hull of ST has a slope which is lower than the one of the first edge of the 
convex hull of S2. 

As ST ends at the lower leaning point Lil then, according to Proposition 1, if 
at least two lower leaning points are present in the segment, the slope of the last 
edge of the convex hull of ST is a\ otherwise, the slope is lower than a\. 

Let L\lK be the first edge of the convex hull of S2, whose slope is /3, we must 
consider two cases: 

- UK G SI, necessarily K is located above the lower leaning line of SI therefore 
(3 > a\. 
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— If iC ^ SI, let us suppose that /3 < ai, i.e., that K is under, or on, the lower 
leaning line of SI (see Fig. 9). The points Li^, M, and K belong to the 
segment S2. However that is impossible because the distance between M and 
the leaning line of SI is greater than 1, and so fortiori^ the one between M 
and L\lK. Therefore these three points may not belong to the same discrete 
line segment and (3 > a\. 



M 




Fig. 9. Figure illustrating proof of Theorem 2 



This property implies that the lower convex hull of the union of S’l and S’2 is 
the union of the lower convex hulls of these both segments. As a generalisation, 
the lower convex hull of cl is the union of convex hulls of the S’i. However each 
discrete segment is, by definition, lower convex. Therefore cl is also lower convex, 
according to the definition of convexity by Kim and Rosenfeld (see Sect. 2) based 
on the convex hull of the set. □ 

Complexity of the construction of curves q, i = 1..4, is in 0(m + n), m 
and n being the numbers of rows and columns of the polyomino. Let us say a 
few words about the complexity of the SegConv algorithm. We first notice that 
the algorithm actually terminates. Indeed, at each step, the variable firstpoint 
is incremented by at least one element (in a non- void segment, is always 
different of the first element). On the other hand, the segmentation algorithm 
described in [7,6] is linear in the number of points of the curve to scan. However, 
the variant we propose obliges to re-start every time at the last lower leaning 
point encountered, that leads to some overlaps. This algorithm is actually linear 
despite its overlaps. The proof of its linear complexity will be published in a 
next article. 

5 Applications to Discrete Tomography 

The objectives of discrete tomography is the reconstruction of a finite set of 
points from a number of projections. Several authors have been interested in 
this subject by using miscellaneous approaches to obtain exact, approached, or 
random solutions [9]. 
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The particular case of polyominoes reconstruction from their horizontal and 
vertical projections is a NP-complete problem [19]. Moreover, if we suppose h- 
convexity (or v-convexity) , the problems remains NP-complete [2] . On the other 
hand, if we consider hv-convex polyominoes, we get today several polynomial 
algorithms to decide if there is or not a polyomino corresponding to given pro- 
jections [2,3,5]. The best existing algorithm is the one of Chrobak and Diirr [5]. 
This algorithm determines, for all couples of rows k and /, if there is a hv-convex 
polyomino anchored in k and /, with given horizontal and vertical projections. 
It is done by translating the existence of hv-convex polyominoes anchored in k 
and I as a 2SAT problem (conjunction of disjunctive clauses with at most two 
boolean variables), whose size is in 0(mn). A 2SAT problem may be solved lin- 
early in its size [1] and the number of possible choices for k and I is min(m^, n^), 
as roles of rows and columns may be swapped. Therefore, it gives a complexity in 
0{mn min(m^, n^)) for the reconstruction of a hv-convex polyomino with given 
vertical and horizontal projections. In fact, we may limit choices of k and /, and 
then obtain an even more efficient implementation [5]. 

If we add to the algorithm of Chrobak and Diirr a convexity test, by us- 
ing any method presented above, the complexity of the algorithm remains in 
0{mn min(m^, n^)). On the other hand, if we want to decide if there is a convex 
polyomino with given projections by first seeking the hv-convex polyominoes 
having these projections, we shall have at worst an exponential complexity, be- 
cause we can have an exponential number of hv-convex solutions [8] . 

We also have started to seek for the convex polyominoes with given orthog- 
onal projections by using the paradigm of constraint programming. Let us give 
a short description of it: 

1. A constraint program is the input 

— of a priori domains in which variables of the problem are authorised to 
take their values, 

— of constraints to which these variables are submitted, 

— and of a strategy of assigning these variables. 

2. The running of a constraint program consists in the repetition until a solution 
emerges, or as long as possibilities exist, of the following iteration: 

— select, as a function of the prescribed strategy, a variable to assign, and 
an assignment value, among those in the domain of the variable which 
satisfy the constraints, 

— update all constraints related to the variable which has just been as- 
signed. 

This iteration is associated, in case of a dead end (none of the variables 
remaining for assignment may be assigned), with a backtracking process, 
back to the last choice point. 

This approach had already been used for the detection of the existence of hv- 
convex polyominoes with given orthogonal projections [20]. We have [17] mod- 
ified the program written at this occasion, by adequately enriching the set of 
constraints. 
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This new program has successfully run on a first sample of data. Moreover, we 
have observed the following: if we just take care of how to express the constraints, 
the adopted process allows us to take them into account globally, rather than 
successively. So, we avoid to apply first constraints related to the hv-convexity, 
and then the ones related to the convexity itself. This global treatment allows us 
to obtain convex polyominoes more quickly, when they exist, without having to 
enumerate previously the hv-convex polyominoes, which may be more numerous. 
And in the same way, it is possible to decide more rapidly that none of them 
exists, when it is the case. 

These first results are encouraging and push us to deepen this approach. 

6 Conclusion 

Our work has permitted a step forward in two directions. 

On the one hand, we obtain a characterisation of lower and upper convex 
hulls of a discrete line segment based on the arithmetic interpretation of these 
objects. 

On the other hand, we obtain a linear algorithm to test the convexity of a 
polyomino, which is very simple to express. We prove the correctness of this 
algorithm. After this paper was prepared, the authors’ attention was called by a 
reviewer to Hiibler, Klette and VoB’ paper [10]. We will compare their approach 
and ours in our oral presentation. 

Moreover, we have explored an alternative approach of the convexity test, 
in particular by using the methods of constraint programming. We have also 
applied the convexity test to some problems of discrete tomography. In both 
cases, the first results obtained urge us to continue on this way. 
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Abstract. We consider the problem of finding the best match for a given 
query shape among candidate shapes stored in a shape base. This is cen- 
tral to a wide range of applications, such as, digital libraries, digital film 
databases, environmental sciences, and satellite image repositories. We 
present an efficient matching algorithm built around a novel similarity 
criterion and based on shape normalization about the shape’s diameter, 
which reduces the effects of noise or limited accuracy during the shape 
extraction procedure. Our matching algorithm works by gradually “fat- 
tening” the query shape until the best match is discovered. The algorithm 
exhibits poly-logarithmic time behavior assuming uniform distribution of 
the shape vertices in the locus of their normalized positions. 

Keywords: image retrieval, shape-based matching, image bases 



1 Introduction 

The last few years, there is an emerging need to organize and efficiently use 
large pools of images that have been collected over the last decades and contain 
information potentially useful to areas such as medicine, journalism, weather 
prediction, environmental sciences, art, fashion and industry. It is estimated 
that there are more than 20 million pages containing hundreds of millions of 
images on world wide web pages alone [7]. Traditionally, images were retrieved 
by their filename, other technical characteristics such as date and size or through 
text keywords, in the case of manually annotated images. Manual annotation, 
except for being a time consuming and not real-time process, can describe only 
a very small percentage of the information that an image contains. 

Recently, there is an increasing effort to organize and retrieve images by con- 
tent based on characteristics such as color, texture, and shape. A number of 
methods in the literature perform indexing and retrieval based on global image 
characteristics such as color, texture, layout, or their combinations. QBIC [10,15], 
a system developed at IBM Almaden supports retrieval by color histograms, tex- 
ture samples (based on coarseness, contrast and directionality), and shape. QBIC 
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Technology) Bilateral Research Cooperation Grant between Greece and the Czech 
Republic 
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uses i?* trees to process queries based on low- dimensionality features, such as, 
average color and texture. Shape matching is supported using either dimension- 
ality reduction, which is sensitive to rotation, translation and scaling [17], or 
by clustering using nonlinear elastic matching [9,4], which requires a significant 
amount of work per shape and some derived starting points as a matching guide. 
QBIC also supports video queries. 

Ankerst et al [1] present a pixel-based shape similarity retrieval method that 
allows only minor rotation and translation. Their similarity criterion assumes 
a very high dimension (linear to the number of pixels in the image), therefore 
dimensionality reduction is performed. 

Gary and Mehrotra [16,12,11] present a shape-based method, which stores 
each shape multiple times. More specifically, the shape is positioned by nor- 
malizing each of its edges. The space requirements of this method impose a 
significant overhead. The method is quite susceptible to noise, thus the authors 
present a sophisticated preprocessing phase to eliminate the noise effects. Fi- 
nally, the method favors those shapes of the shape base, which have almost the 
same number of vertices as the query shape. 

Hierarchical chamfer matching (see [6] for hierarchical chamfer matching 
and [3,5] for chamfer matching) creates a distance image using information from 
the edges, and then tries to minimize the root mean square average of the val- 
ues in the distance map that a contour hit. Hierarchical chamfer matching gives 
quite accurate results rather insensitive to random noise, but involves lengthy 
computations on every extracted contour per query. In the hierarchical version 
a resolution pyramid is used to improve the performance of the matching algo- 
rithm. 

In this work, we present a shape-based method, where information regarding 
the boundary of objects is automatically extracted and organized to support a 
poly-logarithmic (in the number of shape vertices) algorithm based on a novel 
similarity criterion. Specifically, this paper makes the following technical contri- 
butions: 

- introduces a new similarity criterion for shapes, which works better in the 
context of image retrieval than traditional similarity criteria; 

- describes a novel way of storing shapes which is tolerant to distortion; 

- presents an efficient algorithm for finding the closest match to a given query 
shape; its time complexity is poly-logarithmic in the number of vertices of 
the shape base assuming uniform distribution of the vertices in the locus of 
their possible locations; 

- the algorithm can be easily extended to retrieve the k best matches instead 
of the single best match. 

The rest of this paper is organized as follows. Section 2 presents our similarity 
criterion for shapes and compares it with existing criteria. Section 3 describes 
the organization of the data describing the shapes, and the algorithm to retrieve 
the best match of a query shape. Section 4 presents some experimental results, 
while Section 5 concludes the paper and discusses future work. 
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2 Similarity Criteria 

The Hausdorff distance is a well studied similarity measure between two point 
sets A and B. The directed Hausdorff distance h and the Hausdorff distance H 
are defined as follows: 

h{A, B) = maj^aeA min^^^ d(a^b)^ H {A^ B) = max{h{A, B) , h{B , A)) , 
where d is a point-wise measure, such as, the Euclidean distance. An inherent 
problem with the Hausdorff distance is that a point in A that is farthest from any 
point in B dominates the distance. To overcome this problem, Huttenlocher and 
Rucklidge have defined a generalized diserete Hausdorff distanee (see e.g. [14]), 
given by the k-th largest distance rather than the maximum: 
hk{A,B) =kthaeA min^^B d{aff), Hk{A,B) = max{h{A, B) , h{B , A)) . 

This metric eliminates somehow the farthest-point domination disadvantage 
of the Hausdorff metric, but works only for a finite set of points (it is mainly 
used for A; = m/2 where m is the size of the point set). The generalized Hausdorff 
distance does not obey the metric properties. 

An interesting alternative measure, called nonlinear elastic matching^ is pre- 
sented in [9]. This measure does not obey the traditional metric properties but 
a relaxed set of metric properties instead. In practice, this provides the same 
advantages as any metric, and therefore can be used for clustering. However, the 
arbitrary number of points distributed on the edges, the need of determining 
certain starting matching points and the complexity of computing such a match 
(p(mn) using dynamic programming [2]) makes this measure inappropriate for 
very large data sets. 

In our algorithm we use a new similarity criterion based on the average of 
minimum point distances: 



havg{A,B) = average^^^ min^^B d{a,b) 

This measure behaves nicely (gives intuitive results) and it can be computed 
quite efficiently, as is shown in the next Section. The metric properties do not 
hold for this measure either, but in some sense they hold for a representative 
average set of points probably different from the original point set. Figure 1 
(left) illustrates an example where, using Hausdorff distance, the shape Q is 
matched with A instead of B {B is intuitively the closest match). According to 
the similarity measure used in our work, B is indeed closer to Q than A. 

3 Efficient Retrieval of Similar Shapes 

The algorithm is based on two key ideas: normalizing a shape about its diameter 
and the notion of the e-envelope. 

Normalizing about the diameter. In order to match a query shape to 
the shapes in the database, some kind of “normalization” is applied so that the 
matching is translation-, rotation-, and scaling-independent. In [12], Mehrotra 
and Gary normalize each shape about each of its edges: they translate, rotate, 
and scale the shape so that the edge is positioned at ((0, 0), (1, 0)). Although this 
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Fig. 1. (left) Depending on the similarity criterion, the query shape Q may be 
matched with A or B; (right) the e-envelope 

approach gives good results in many cases, it would fail to retrieve the distorted 
shape on the right of Figure 2, if the shape on the left of the figure was used as 
the query shape. In our retrieval system, instead of normalizing about the edges, 
we normalize about the diameter of the shape, i.e., by translating, rotating, and 
scaling so that the pair of shape vertices that are farthest apart are positioned 
at (0,0) and (1,0). This ensures better results, because the diameter is less 
susceptible to local distortion (like the one shown in Figure 2), which is very 
common in shapes extracted via automated image processing techniques. 





Fig. 2. (a) the query shape; (b) a distorted shape extracted from an image 

The e-envelope. The algorithm works by considering a “fattened” version of 
the query shape which is computed by taking lines parallel to the query shape 
edges at some distance e on either side (Figure 1 (right)); we call this fattened 
shape the e- envelope. The good matches are expected to fall inside or at least 
have most of their vertices inside the e-envelope even for small e. Therefore, if 
we start by using a small initial value of e and keep increasing it, we expect to 
collect the good matches after a few iterations of this procedure. 

The e-envelope can be seen as a collection of trapezoids of height 2e, one for 
each edge of the query shape. (For simplicity, we assume that e is such that no two 
trapezoids are overlapping; the method can be extended to handle overlapping 
trapezoids.) 
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3.1 Populating the Shape Database 

Populating the database of shapes is done by processing each available shape, 
a polygon or polyline extracted from an image, as follows. First, we compute 
the diameter of the shape, i.e., the pair of vertices that exhibit the longest 
Euclidean distance. In order to achieve even better tolerance to distortion, we 
will not simply normalize the shape about its diameter, as we alluded earlier; 
instead, we will normalize it about all its a-diameters, i.e., all pairs of vertices 
whose distance is at least 1 — a times the length of the diameter (0 < < 1). For 

each Q!-diameter, we scale, rotate, and translate the shape so that the a-diameter 
is positioned at ((0, 0), (1, 0)); each shape is stored twice for each a-diameter by 
taking both ways to match the two vertices defining the <a-diameter to the points 
(0,0) and (1,0). All these “normalized” copies of the shape constitute the shape 
base, the database of shapes. 

Of course, a shape with s vertices may have i?(n) o;-diameters, which would 
effectively result in an 0(n^)-size database to store shapes of 0{n) total size. 
However, this happens to fairly regular shapes; shapes extracted via automated 
image processing techniques are unlikely to be regular. In fact, experiments have 
indicated that for a = 0.15 the number of copies of each shape is about 12 on 
the average, including the doubling due to the double storage of a shape for a 
given (a-diameter (the average number of edges per shape in the test set was 20). 



3.2 Outline of the Matching Algorithm 

The algorithm works by considering e-envelopes of the query shape for (appro- 
priately) increasing values of e; for each such e, the polygons that have most of 
their vertices inside the e-envelope are determined and for each of them the value 
of the similarity measure to the query shape is computed. The algorithm stops 
whenever the best match has been found, or e has grown “too large” implying 
that no good matches exist in the shape base. In the latter case, we revert to an 
alternative but compatible geometric hashing method which is outlined in the 
technical report version, due to space limitations. 

In more detail, the basic steps of the algorithm for the retrieval of the 
database shape that best matches the query shape are: 

1. We compute an initial value ci = Cg such that the ei -envelope is likely to 
contain at least one shape of the shape base (see Section 3.3). We set cq = 0 
and we signal that we are in the first iteration by assigning i = 1 . 

2. We collect the vertices of the database shapes that fall in the difference (e^- 
envelope — e^_i -envelope); this can be achieved by partitioning this differ- 
ence into triangles and preprocessing the vertices so that inclusion in a query 
triangle can be answered fast (simplex range searching). (If no vertices are 
found then the difference — e^_i is increased geometrically.) Additionally, 
each time we find that a vertex of some shape is inside the above envelope 
difference, we increase a counter associated with that shape that holds the 
number of its vertices that are inside the e^-envelope. 
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3. If no shape of the shape base has at least a fraction 1 — f3 of its vertices 
inside the e^-envelope (for a parameter (3 such that 0 < /3 < 1), a new larger 
e is computed (Section 3.5) and we go to step 5. 



of their vertices inside the e^-envelope (these are the candidate shapes)^ we 
process them as described in Section 3.4. During the processing, we may 
either conclude that the best match has been found, in which case it is 
reported to the user and the execution is complete, or a new larger value of 
e is computed. 

5. We increment i and set = e. If does not exceed 2 piQ we go to 

step 2 and repeat the procedure {A is the area of the locus of the normalized 
shapes (Section 3.3), p is the number of shapes in the shape base, n is the 
total number of vertices of the p shapes, and Iq is the length of the perimeter 
of Q); otherwise, we report the best match so far (if any) and exit. If no match 
has been found, we employ geometric hashing. 

The method converges and if there exist similar shapes it retrieves the best 
match. 

3.3 Computing the Initial Width 2eg of the e-Envelope 

We hrst compute an initial estimate e of the width of the e-envelope based on 
an estimate of the number of vertices that fall inside the envelope. Then, we 
calculate the actual number of vertices of the database shapes. If is at 
least half and no more than twice we set = e; otherwise, we adjust e by 
performing binary search in the values of K^. 

The computation of e is done as follows. Let us compute the area A of the 
locus of the vertices of the normalized shapes. If the shapes were normalized 
with respect to their diameter only, then all the vertices would fall in the lune 
defined by two circles of radius 1 whose centers are at distance 1 apart. Since we 
store copies of each shape normalized for all pairs of vertices whose distance is 
at least 1 — a times the length of the diameter (0 < < 1), the area A is equal 

to the area of the lune defined by two circles of radius whose centers are at 



By assuming uniform distribution of the vertices inside this lune, the average 
number of vertices inside an e-envelope around the query shape Q is estimated 
to: 



and the total number of vertices of all the shapes of the shape base, respectively. 

In order that the initial e-envelope contains at least enough vertices for a 
candidate shape (at least a fraction 1 — j3 of its vertices lie inside the envelope), 
we derive that: 

Ke > {I — [3)^ where p is the total number of shapes in the shape base. 

The above estimate may yield the necessary number of vertices, but the 
probability that all of them belong to the same shape is very small. So, we 



4. If there are shapes of the shape base that have at least a fraction 1 — f3 



distance 1 apart. This implies that A = cos ^(^^) 




Ke = where Iq and n are the length of the perimeter of the query shape Q 
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determine experimentally a 7(n), and set = (1 ~ 7(^) which implies that 

Ki: = ki: = {l- e=i|^^7(n). 

Through experimentation, we have determined that a good choice for 7(71) 
for relatively small shape bases (see Section 4) is: 7(71) = 51og7r. 

3.4 Processing the Shapes 

We first process all the new candidate shapes, that is, the shapes that have more 
than a fraction 1 — (3 oi their vertices inside the current e-envelope but did not 
do so in the previous envelopes. For each such shape Pj, we compute the value 
of the similarity criterion of Pj with respect to the query shape Q, which we will 
call the cost Cj of Pj : 

laeP d{a,b) 

Cj = miiib^Q d{a,b) = length (Pj) ’ where by length{P) we 

denote the length of the perimeter of P. The computation is done by intersecting 
each edge of Pj with the Voronoi diagram of Q; the boundary of Pj is thus split 
into segments that are close to either a vertex or an edge of Q. The contribution 
of each such segment s in min^^g d(a, b) can then be easily computed: 

— if 5 is in the Voronoi region of an edge e of Q and s does not cross e, then 

the contribution of s is equal to c{s) = length{s), where di and t/2 are 

the distances of the endpoints of 5 from e; 

— if 5 is in the Voronoi region of an edge e of Q and s crosses e, then the 

contribution of 5 is equal to c{s) = 2 {d^-kd 2 ) where di and t/2 are 

the distances of the endpoints of s from e; 

— if 5 is in the Voronoi region of a vertex u of Q, then the contribution c(s) is 
given by a more complicated expression, which (of course) only depends on 
the coordinates of the endpoints of s and of the vertex v. 

W c(s) 

Then, Cj = than the cost Cmax of the best match so far, 

then Pj becomes the current best match and Cmax is set equal to Cj. 

Next, we process all the shapes that are not yet candidates (and have at 
least a vertex other than (0,0) and (1,0) in the current e-envelope), in order 
to determine whether we have found the best matches, and if not to produce a 
new larger e for the e-envelope. So, for each of these shapes, say, we compute 
the contribution min^^g <i(a, b) of each of its edges e that has at least one 
endpoint inside the e-envelope. Let the sum of all these contributions be tj and 
let the total length of all these edges be dj. We check whether /en^A(5 ) 
^{length(Sj) — dj)) > Cmax- If yes, the cost of Sj will not be less than Cmax] 
this is so, because the edges of Sj with at least one endpoint in the current 
e-envelope contribute tj in min^^g d{a^b), whereas the remaining edges 

will contribute more than e/2 times their length (each edge has both endpoints 
at distance larger than e away from Q). So, if the above inequality holds, we 
ignore Sj from now on. Otherwise, we compute, Cj = 1 ^ which 
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turns the previous inequality into equality. Note that ej is larger than the current 
width e of the envelope. 

After all the SjS have been processed, we consider the set of collected e^s. If 
the set is empty, then we have found the best match and we stop. Otherwise, 
we select the smallest element of the set and we use it as the new width e of the 
envelope. 

3.5 Increasing e in the Absense of Candidate Shapes 

In this case, all the shapes of our shape base have less than a fraction 1 — j3 of 
their vertices inside the current e^-envelope. Then for each of the shapes that 
have a vertex other than (0, 0) and (1, 0) in the envelope, we do the following. 




Fig. 3. Distribution of the vertices inside the upper part of the lune; the lower 
part is symmetric 



Let Pj be such a shape with Uj vertices and let Vj be the set of its vertices 
that are inside the envelope. Consider the set Vj of vertices of Pj that are 
adjacent to vertices in Vj; for each vertex Vk in VJ — Vj, we compute the shortest 
distance dQ(vk) of Vk from the query shape Q. If the total number of vertices 
in Vj U V' exceeds (1 — (3)nj (i.e., there are more than a fraction 1 — (3 of Pj^s 
vertices in Vj U VJ), then we set e' equal to the ((1 — f3)nj — |V^ |)-th smallest 
distance dQ{vk); this implies that Pj will be a candidate shape in the e' -envelope. 
In the case that the total number of vertices in VjUVJ does not exceed {l—f3)rij, 

then we set e' equal to (i.e., we use linear interpolation in order to 

estimate the width of the envelope for which Pj will be a candidate shape). 

After all these e' s have been computed, we collect the smallest among them 
and use it as the new e of the envelope. 

3.6 Time Complexity of the Matching Algorithm 

Before analyzing the time complexities of each of the steps of the algorithm, 
we recall that in order to compute the similarity measure, we make use of the 
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Fig. 4. A best match within the initial guess of the envelope 



Voronoi diagram of the query shape Q. This can be computed in O(mlogm) 
time, where m is the size of Q. 

Step 1 of the algorithm begins with the computation of e which takes 0{1) 
time. Then, the number of vertices that fall inside the e-envelope is computed; 
this can be done in O(poly-logn) time using simplex range counting algorithms 
and quadratic or near-quadratic space data structures. If the computed number 
greatly differs from the expected number, O(logn) repetitions of the previous 
computation are done, resulting in O(poly-logn) total time for this step. 

In step 2, we need to compute the vertices of the shapes in our database 
that fall in the difference of the e^-envelope — e^_i-envelope (this ensures that a 
vertex will not be processed or counted multiple times) . The difference of the m 
trapezoids (one for each of the m edges of the query shape) can be decomposed 
into 0{m) triangles which can be used with simplex range reporting data struc- 
tures of near-quadratic space complexity that take 0(log^ n + ^) time per query 
triangle, where n is the total number of vertices of the shape base and ^ is the 
number of vertices that fall in the triangle [13]. (There are also quadratic-size 
data structures that allow for 0(logn + k.) query time by employing fractional 
cascading [8].) Thus completing the i-th iteration of step 2 takes 0{m log^ n-\-Ki) 
time in total, where Ki is the number of vertices in all the query triangles for 
that iteration. 

The z-th iteration of step 3 takes 0{mKi) time, where Ki is again the number 
of vertices between the e^-envelope and the e^-i -envelope. 

Step 4 involves processing the new candidate shapes and the non-candidate 
shapes. The former takes time 0{m\Pi\), where \Pi\ denotes the number of ver- 



514 



loannis Fudos and Leonidas Palios 



tices of Pi. Processing the non-candidate shapes can be performed in 0{lPmKi) 
time, by maintaining the contributions of vertices in previous envelopes and sim- 
ply adding the contributions of vertices between the e^-envelope and the e^_i- 
envelope. Step 5 takes constant time. 

The overall time complexity after r iterations is therefore, 

0{m logm) -1- O(poly-logn) -1- Yll=i 0{poly-logn + mKi) = 

0{m logm) 0(r poly-logn) H- 0{mK) 

where K is the total number of vertices processed and since the total number 
of candidate shapes is 0{K). This is 0(r poly-logn + K) since the size m of the 
query shape is constant. Finally, by assuming uniform distribution of the vertices 
inside the lune and in light of the test for in step 5, the number K of vertices 
and the number r of iterations is expected to be poly- logarithmic in n, and 
therefore the total time complexity is poly- logarithmic in n. 

4 Experimental Results 

The algorithm has been implemented in C and the user interface has been devel- 
oped using Tcl/Tk. We currently have a stable version running on a Sun Solaris 
platform. The software is easily portable to various platforms. The user is first 
presented with a workspace where she/he can draft a query sketch, which is 
subsequently presented first to our matching algorithm and in case of failure to 
the geometric hashing approach. 

We have performed experiments with around 100 images and 350 actual 
shapes. Each shape is stored on the average approximately 12 times resulting 
in a shape base populated with 3000 normalized shapes. The total number of 
vertices was n = 30000. The distribution of the vertices was not uniform because 
of the specialized nature of the images (see Figure 3). 

In the experiments we used a = 0.15, = 0.15 and y{n) = 5 log 2 n. The 

initial estimation of was usually very close to the actual number of vertices 
that fell in the envelope. In the case of Figure 4, we started with an initial 
estimation of 244 for which gave an ei = 0.0030 with an actual 180 vertices 
inside the envelope. In this case we obtain immediately a best match with cost 
c = 0.0013. Similarly in the case of Figure 5, we find a best match with the 
first iteration with slightly larger cost c = 0.0792 since two edges of the matched 
shapes are partially outside the envelope. Finally, in Figure 6 we find a best 
match after a second refinement iteration; in this case, the returned shape for 
a triangular query shape is a polygon with 10 vertices that has a cost around 
c = 0.0008. Even though the vertices are not uniformly distributed in the lune, 
the algorithm behaves as expected in terms of number of iterations and time 
complexity. 

5 Conclusions and Future Work 

We have presented an efficient noise tolerant shape-based approach to image 
retrieval. The algorithm currently yields the best match, but it can be easily 
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Fig. 5. Querying with an orthogonal shape; the door of the church is the best 
match 



extended the k best matches; in this case, a heap of size k is used to hold the k 
current best candidates. 

We are currently incorporating this algorithm in a video retrieval system, 
which will allow us to experiment with larger shape bases. Other future research 
directions include finding alternative ways to do the range searching (whose 
space requirement is high), ensuring robust calculations, adding 3D awareness 
support, and using relative position information for allowing more complicated 
queries such as containment and tangency. 
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5. G. Borgefors. An improved version of the chamfer matching algorithm. In 
ICPR1984, pages 1175-1177, 1984. 506 

6. G. Borgefors. Hierarchical chamfer matching: A parametric edge matching al- 
gorithm. IEEE Transactions on Pattern Analysis and Machine Intelligence^ 
10(6):849-865, 1988. 506 

7. G. Garson and V. E. Ogle. Storage and retrieval of feature data for a very large 
online image collection. IEEE Bulletin of the Tech. Comm, on Data Engineering^ 
19(4):19-27, 1996. 505 

8. B. Ghazelle and L. J. Guibas. Fractional cascading: I. a data structuring technique; 
II. applications. Algorithmica^ 1:133-191, 1986. 513 

9. Ronald Fagin and Larry Stockmeyer. Relaxing the triangle inequality in pattern 
matching. International Journal of Computer Vision^ to appear, 1999. 506, 507 

10. M. Flickner, H. Sawhney, W. Niblack, J. Ashley, Q. Huang, B. Dom, M. Gorkani, 
J. Hafner, D. Lee, D. Petkovic, D. Steele, and P. Yanker. QBIG: Query by image 
and video content. IEEE Computer^ 28(9):23-32, 1995. 505 

11. J. E. Gary and R. Mehrotra. Similar shape retrieval using a structural feature 
index. Information Systems^ 18(7):527-537, 1993. 506 

12. J. E. Gary and R. Mehrotra. Feature-index-based similar shape retrieval. In 
S. Spaccapietra and R. Jain, editors. Visual Database Systems, volume 3, pages 
46-65, 1995. 506, 507 

13. J. E. Goodman and J. O’Rourke. Handbook of Discrete and Computational Ceom- 
etry. GRG Press LLG, 1997. 513 

14. D. P. Huttenlocher and W. J. Rucklidge. A multi-resolution technique for com- 
paring images using the hausdorff distance. Technical Report TR92-1321, GS 
Department, Gornell University, 1992. 507 

15. IBM. Ibm’s query by image content (QBIG) homepage, 
http: // wwwqbic.almaden.ibm.com. 505 



An Efficient Shape-Based Approach to Image Retrieval 517 



16. R. Mehrotra and J. E. Gary. Similar-shape retrieval in shape data management. 
IEEE Computer, 28(9):57-62, 1995. 506 

17. W. Niblack, R. Barber, W. Equitz, M. Elickner, E. Glassman, D. Petkovic, and 
P. Yanker. The QBIG project: querying images by content using color, texture 
and shape. In Proc. SPIE Conference on Storage Retrieval for Image and Video 
Databases, volume 1908, pages 173-181. SPIE, 1993. 506 



Towards Feature Fusion — The Synthesis of 
Contour Sections Distinguishing Contours from 

Different Classes 



Dag Pechtel and Klaus-Dieter Kuhnert 



University of Siegen 
57068 Siegen, Germany 

{pechtel , kuhnert}@pd . et-inf . uni-siegen . de 
http : / / WWW . pd . et-inf . uni-siegen . de 



Abstract. In real world problems, where the objects are in general com- 
plex and deformed, the automatic generation of local characteristics is 
necessary in order to distinguish different object classes. 

This paper presents an approach towards the automatic synthesis of sig- 
nificant local contour sections of closed, discrete, complex, and deformed 
2D-object contours for the distinction of different classes. Neighboring 
contour points are determined and synthesized (feature fusion) into fea- 
ture groups. Exclusively with the help of these feature groups the method 
distinguishes between different 2D-object contour classes of a certain do- 
main. The basic idea is to get only the necessary information of a contour 
or a contour class for recognition. 



1 Introduction 

A basic problem in the field of pattern recognition is the automatic synthesis of 
elementary features to higher level features (feature fusion) , that can distinguish 
between certain pattern classes of a task domain. 

This paper presents an approach for closed discrete 2D-object contours, 
whose base features, the contour points, are nearly automatically fused to lo- 
cal and significant feature groups, called the significant contour sections. These 
significant contour sections distinguish a contour from the contours of other con- 
tour classes in a task domain. E.g. a significant contour section of a bottle could 
be its bottle neck and significant contour sections of a fish could be its fins. The 
method even works with contours that are complex and/or badly deformed. 

There are many approaches in literature concerned with the analysis of con- 
tour similarities [L97]. There are alignment approaches [U96], the use of invariant 
properties (basics in [PM47]) that is often a global approach, and the use of parts 
([HS97], [LL98b], [KTZ94], [SIK95], [STK95]) that is sometimes based on shape 
simplification [LL98a], or structural descriptions ([KW96], [P77]). Some works 
deal with deformation and minimize a cost function: [BCGJ98] define ’elastic en- 
ergy’ needed to transform one contour to the other. [SGWM93] is concerned with 
shape interpolation, also known as morphing, or the sequence of contour points 
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is interpreted as a string in which points are substituted (shifted), inserted, and 
deleted ([PKOO], [SK83]). This work is based on [PKOO]. It uses shares of the 
alignment approach and isn’t psychophysically motivated. This work is concen- 
trated on the nearly automatic synthesis of local contour sections for recognition 
and is a step towards the autonomous creation of abstract local characteristics 
of a contour class consisting of a variety of similar, but not equal objects (eg. 
the birds in fig. 8). Besides the variety, we also interpret outer influences (eg. 
occlusion, bending) to an object, autonomous motions (eg. stretching out an 
arm) of an object, and different projections of an object all as deformations. 

Basic notations used in this article are presented in section 2. In section 3 the 
determination of important contour sections, which distinguish between two dif- 
ferent 2D-object contours, is briefly sketched. Section 4 shows how the important 
contour sections of one contour are combined to significant contour sections of 
that contour. In section 5 experimental results are presented based on a data set 
of deformed contours of reusable material collections (plastic bottles, beverage 
cartons (fig. 7) and a data set of birds, fishes, and plastic bottles (fig. 8). With 
a summary in section 6 this paper concludes. 

2 Notation 

The task domain D consists of N classes Kf. 

7^ = {iFi,...,iT„...,iFw}. (1) 

Each class Ki consists of rrii closed, discrete 2D-object contours Cij: 

iF, = {C,,i,..., Q, ,,..., (2) 



Each contour Cij consists of riij neighboring successive points Pij^k with equal 
Euclidean distances d. Each contour consists also of riij sections '• 



Ci,j — {Pi,j,l^ • • 


• 1 Pi,j,k •) • • 


• 7 Pi,j,riij } 


( 3 ) 


r 

’ • • • 




'i 


( 4 ) 


Each section consists of {‘2lij^k 


+ 1), h,j,k 


> 1 points Pij,k‘ 






• • • 7 Pi,j,k-) 


• • • 7 Pi,j,k+lij^k }• 


( 5 ) 



Therefore, neighboring sections have at least 2 common points Einally 

each point Pij,k consists of a x- and a y-coordinate: 



Pi,j,k — 5 (b) 

Each section has an orientation vector Oij^k- 



( 7 ) 
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Fig. 1. Example of a closed discrete contour Ci^j with one of ni^j sections 
(marked bold; = 2) and the corresponding orientation vector Succes- 

sive points have all equal Euclidean distances d 



An example of such a discrete contour is shown in fig. 1 . 

Two contours are distinguished with the following indices: Ci^j and Two 
points or sections are distinguished as follows: Pij^k,Pr,s,t and , 

respectively. 

A local comparison of two sections results in a local similarity j 5 ^^): 



/i: 



A 



{i,j,k),{r,s,t) 



p\^i,j,k ' 



C " lr,s,t \ 

r,s,t ) 



(8) 



and is entered in the kth row and in the tth column of a local similarity matrix 






= A 



(i,j,k),{r,s,t) 



(9) 



which results from a local similarity analysis of all possible section pairs of two 
contours Cij and Cr,s' 



h- =f2{Cij,Cr,s)- ( 10 ) 

The point indices have to be calculated modulo, because the contours are closed. 

3 Determination of Important Contour Sections 

In [PKOO] a method is presented for the determination of important contour 
sections that distinguish between 2 different contours. That method is based 
on a local similarity analysis of all possible section pairs with a 

constant number of points {2lij^k + 1 ) = + 1 ) = const and is shortly 

recapitulated here in a little changed form. 

In fig. 2 is shown an example for the local comparison of two sections CP 
g f with lij^k = lr,s,t = 2. Eirst, the sections were translated, that their start- 
ing points Pij^k-2^ Pr,s,t-2 coincided with the origin of the Carthesian coordinate 
system. Second, the sections were rotated around the origin that their orienta- 
tion vectors Ozy,/c, coincide with the positive x-axis. And third, section g ^ 
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X 



Fig. 2. Example for the comparison of two local contour sections j ^r,s,t 
with hj^k = lr,s,t = 2 after translation, rotation (orientation vectors Oij^k^ Or,s,t 
coincide with the positive x), and minimizing the sum of Euclidean distances 
of relating contour points. The 5 relating contour points are connected with 5 
straight lines 



was translated along the x-axis, that the sum of squared Euclidean distances of 
relating contour points, eg. starting points or ending points of the sections, is 
minimized. The 5 relating contour points are connected with 5 straight lines in 
fig. 2. 

Here, we search for the maximum number of points hj,k = W,s,t of tho sec- 
tion pair around the points Pij^k, Pr,s,t^ that the local distance 

(dissimilarity) e < E. E is the given maximum permitted distance (dissimilar- 
ity) between 2 sections, e is calculated with an approximate one-to-one point 
matching ([PKOO]; a survey can be found in [AG96]). 

With this definitions and the notation in section 2, the steps for the calcula- 
tion of the are combined in the algorithm shown in fig. 3. This results 

in a rii^j x rir^s ~ matrix of local similarities. The elements 

give the maximum length of the sections and around the 

points Pij^k and Pr,s,tj with respect to a given maximum dissimilarity E. In 
other words, App'^^'^^ gives the osculating length of two contour sections that 
can be treated equal in respect to this fixed maximum error E. 

In order to get the transformation that possesses minimum cost for the defor- 
mation from Ci^j to Cr^s exactly one monotone discrete path (list of neighbored 
elements) is searched in Eor that, the cost function 



z 

.V'i/) G {1,2,..., ^max (11) 

;2 = 1 

with max{riij ^Tir^s) ^ Z < {riij -h rir^s) is maximized with the help of dy- 
namic programming technique (e.g. [B57], [SK83]). Because of the existence of 
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for k — 0 . . . riij 
for t = 0 . . . rir,s 

while e < E 



^ A\^f 3 r,s) ^ ^ 

k,t 






’ ’ k,t 


• • 5 Pr,s,t 




II Calculate the translation of C^jk 


and into the origin 



//of the Cartesian coordinate system 



c/ = { 0 , . 



, Pi.i.k — p. 



i,j,k — z,j,fc + a 



(i,j),(r,s) 



-P. 



i,j,k — A 



k,t 






c: 



= {0, . . . , Pr,s,t — P 






).■■■. P, 



k,t k,t 



(i,i),(r,s) } 



// Calculate the orientation vectors of and CJA 

Oi,j,k = Pij^k+A['f-<-'^-‘>'> ~ ^i,j,k-A<j^f’<-''’‘> 

Or,s,t = 

’ ’ k,t ’ ’ k,t 



rotate and C,'’’’"’* 



■ j ^ ci±±^ ^r,s,t II "that the orientation vectors Oij^k ana Or,s,t 



Oij^k and o, 

II coincide with the positive x-axis 
// resulting in and 

II Calculate the minimum sum of squared Euclidean distances e 
//of contour points belonging together (1-to-l match) 



+A 



{%,3),{r,s) 

k,t 






+ a ^ijjjk (2^r,s,t + a ~h 

~h(yi,j,fc + a ^r,s,t+a) ^ 



// with Xij^k^Xr^sA are the means of the 
// + 1) ^-coordinates Xij,fc+a, ^r,s,t+a 

end 

end 

end 

Fig. 3. Algorithm for calculating the elements of the local similarity 

matrix 2 contours Cij,Cr,s- Comments are marked with // 



E = min(nij , Tir^s) possible start elements for j^(i,j),(r,s) (rotation), 

that path is selected over all -0 causing the lowest cost (invariance against rota- 
tion). 

Sections of that contribute a little to C^(i,j),(r,s)5 the osculating 

length is short, map those contour sections of Cij and Cr,s onto each other, that 
are different. Thus, those contour sections are important contour sections I for 
the distinction of the contours Cij and Cr,s. 
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Fig. 4. Mapping of contours and € 2^3 (fig. 7) onto each other with lowest 
cost: a) Visualized similarity matrix of contours Ci ,5 (plastic bottle) 

and C 2,3 (beverage carton) with path (marked in black), b) Visualized 

contour points mapped onto each other with respect to in fig. 4a 



An example of such a path foi* the contours and C 2,3 (fig. 7) is 

shown in fig. 4. The local similarity matrix is shown in 

fig. 4a. Elements of yl(i’^)42,3) scaled between 0 and 255. Thus, yl(i’^) 42 , 3 ) 
be visualized as an image. Light (marked with a dotted square) and dark (marked 
with a dotted ellipse and belongs to the bottle neck) sections respectively in 
A(1’5)’(2’3) or on (marked in black) map contour sections with high 

and low local similarity respectively onto each other. starts at an 

element in the first row marked with a little circle and ends in the last row 
have to be always ’’closed” - imagine that the visualized yl(i’^)’(2,3) 
is bent to a torus). Corresponding circles are found in fig. 4b, too. FLere, the 
mapping of every 20th contour point is visualized with respect to contour Ci, 5 . 
For further details see [PKOO]. 

4 Synthesis of Significant Contour Sections 

With the method described in section 3 important contour sections I can be 
determined, that distinguish between two 2D-object contours. 

This section describes an approach how to generate significant contour sec- 
tions S for a fixed contour Ca,b, belonging to a fixed contour class Ka, from all 
important sections /, that came from the comparison with all contours Cr,s of 
the other classes r ^ A. 



524 Dag Pechtel and Klaus-Dieter Kuhnert 



First, all local similarity matrices ^ ^ r and all paths 

A r have to be calculated (see section 3). Those sections of Ca,b, which belong 
to elements z of satisfying the condition 

< 0 ( 12 ) 

are the important contour sections of Ca,B‘ O is constant for all I and is called 
importance value. Le., now there is a list of all I of the contour Ca,b with 
respect to all contours In general, this list includes both /, that overlap 
each other, i.e. they have also equal contour points, and /, that do not overlap 
each other, i.e. they do not have a single contour point in common. 

An example in principle is shown in fig. 5. The task domain D consists of 
3 classes. Class K\ consists of 3 rectangles, class K 2 consists of 2 ellipses, and 
class ATs consists of 1 triangle. In this example the fixed contour Ca,b = Ci, 2 - 
The important contour sections of Ci ,2 and the corresponding important contour 
sections of all contours of the other classes K 2 ^Ks are marked in bold. This 
results in the marked 4 groups of overlapping important contour sections. 

A reasonable assumption is that contours of the same class are more similar to 
themselves than contours of different classes. Another reasonable assumption is 
that contours of the same class are more similar to themselves because they have 
significant contour sections S. The last assumption is reasonable because human 
beings are able to recognize objects even if the objects are occluded. And there 
is another assumption: many overlapping important contour sections I with only 
a little variation in their position on the contour give a hint that this sections I 
are significant contour sections S of this contour with respect to the task do- 
main D. But which I is the representative one? There is no general answer to 
this question. If there are Qp overlapping important sections /q, 1 < ^ < Qp a 
better way is to answer the foilwing question: 

How are Qp overlapping Iq eomhined to 1 Sp? 

There are many possible approaches to do that. One possibility is to take the 
minimum starting point index = min{sq} and the maximum ending point 
index = max{eq} as starting and ending point indexes Sp^Ep of Sp. But 
this results in very long Sp. Good results were obtained with the calculation of 
the arithmetic means 5q, and the variances Sq, Cq both of the Qp starting 
point indexes Sq, and the Qp ending point indices Cq of all overlapping Iq. Then, 
this results in the following starting and ending point indices Sp^ Ep of the Spi 

Sp = round{sq — ^J^q) , Ep = round{cq + '\f^) A S: P ^ H. (13) 

n is the total number of Sp of a contour and round {. . .) means: round to the 
nearest integer. A simple example for the combination (fusion) of two Iq is shown 
in fig. 6. 

Since contour Ca,b could have several Sp^ another question arises: 
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Fig. 5. Example for overlapping and not overlapping important contour sections: 
3 classes consisting of 3 rectangles, 2 ellipses, and 1 triangle (iFi, 7^2, iTs) are 
the task domain. Here, contour C \^2 is the fixed contour. All important con- 
tour sections of C \^2 and all corresponding contour sections of the contours of 
the other classes K\^K 2 are marked in bold. This results in 4 different groups 
each consists of overlapping important contour sections (marked with different 
dotted ellipses), but no important contour sections of two different groups are 
overlapping each other 



On what depends the signifieanee of the generated Sp and what is a reasonable 
signifieanee measure dp ? 

In general the contour Ca,b have several significant sections Sp. It’s reason- 
able to limit the number of Sp. Because of equation 12 there are Sp that comes 
from not overlapping important sections Iq with path elements near 

O. These Sp have only a few points. Therefore a reasonable significance measure 
dp of Sp should have at least 3 dependencies: 

— The bigger the number Qp of overlapping 7^, the bigger the significance of Sp. 

— The smaller the variation 

Vp — Sq -|- Oq 



( 14 ) 
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Fig. 6. Example for the fusion of 2 of a sketched bottle Cip with respect to 
a class consisting of 2 triangles C2,i,C2,2- Important path sections are marked 
in black. Corresponding important contour sections I are marked bold on the 
contours 



of the starting and ending point indices of the Qp overlapping Iq^ the bigger 
the significance of Sp. 

— The bigger the mean of the sum of dissimilarities dp 



1 1 
Qr, ^ Wa 



with Wa 



q=l 



p(AB),(r,s) 

Z = Sq 



(15) 



between the Qp individual overlapping Iq and the Qp corresponding contour 
sections of the contours the bigger the significance. Wq is the similarity 
of two individual corresponding important contour sections of Ca,b and 



One possibility to combine this 3 named dependencies in a significance measure 
is as follows: 



gp = 1 ^ (16) 

L Vp 

that is abbreviated called significance ap of Sp. 7 are weightings for the 3 
dependencies. Now, with the equations (12)- (16) one Sp is completely described. 

With equation 16 it’s possible to limit the number of Sp on those Sp^ that 
satisfy the condition 



(jp>^(j^^^ ,0.0 <^<1.0. (17) 

^max _ jYi(ix{(jp} mcaus the maximum significance of all Sp of Ca,b- Those Sp, 
that are more significant than other Sp with respect to equation (17) are called 
more significant contour sections My, 1 <v <V and the Sp that is the one 
with the highest significance is called most significant contour section M. 



5 Experiments 

The presented method was tested with a task domain D\ consisting of 2D-object 
contours of packagings of reusable plastic material (fig. 7). The collecting process 
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Fig, 7* Contours of 0 plastic bottles (Ci j — Ci^e) and 6 beverage cartons (C 2 j ^ 
C 2 ^g) with their most significant contour xsections M marked in boid 



Ci.t Ci.T Ci^ C|4 C|^5 C],6 Cj^fi C|,9 Cjjo 

^ 0 . 

C2.1 ^X- ^r,T(j 

C34 C3,2 C3,3 0^4 Cjj C'3,6 C3.7 C 3 ,g C3.9 C 3 ,J 0 



Fig, S. Contours of 10 birds (Ci^i — Cijo), 10 bottles (C 2 J — C'2,io)! and 10 
fishes (Csj — C 3 . 10 ) with tlidr most significant contour sections M inarke^d in 
bold 



and the mechanical preparation in a sorting plant make tlie packagings strongly 
dirty and deformed. The method was also tested with a more complex task 
domain D> consisting of 3 classes: fishes, birds and plastic bottles (fig, 8). 

All contours are scaled so that their biggest generalized diameters are ap- 
proximately equal. The contours have from 400 up to 600 contour points and 
succeasive contour points have all ec[iial Euclidean distances of 1.2 [pixels, because 
it^s a compromise betwcxm 141 and \/5 {horizontal and diagonal pixel distance 
on a SCI u are pixel grid) . The experiments are carried out with heiiristically found 
parameters. In ijoth cases it seems to Ije good if the weigiitiiig of Qp is liigh in 
respect to dp^ Vp. 

The results of the ex]>eriments are shown in fig, 7 and fig. 8, The M of the 
contours are marked in bold. 

For the bottle class in the task domain D[ shown m fig, 7 the most significant 
contour sections M are the bottle necks. Here, it is possible to distinguish tliLs 
two classes only with the help of the l>ottle necks. In the more comjdex task 
domain D 2 the M of the bottles are not only the bottle necks. Also the sections 
around one corner are now significant to distinguLsh the bottles from the birds 
and the fishes. The most significant sections M of the Ijirds are in most Ccises 
the claws but there is also a beak. Interesting is bird C],io wliicii has no claws. 
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Here, a tail feather is the M. The fishes have their most significant sections at 
their fins. Because of the great variety of the fins the presented method find most 
significant fins at different positions on the fish contours. 

6 Conclusion 

A method was presented in this paper, that nearly automatically, supervised 
learns those contour sections My of a certain contour, with which this contour 
can be distinguished from contours out of other contour classes. We call this 
generation of feature groups feature fusion: Base features or feature primitives 
(contour points) are synthesized to feature groups (are fused to more significant 
contour sections My). Now, the fused more significant contour sections My can 
serve as knowledge base of a classifier for assigning an unknown contour to a 
certain contour class. 

Experimental results were made with two different task domains. The method 
extracted that the class of the plastic bottles can be distinguished from the 
class of the beverage cartons only with the help of the contour sections we call 
bottle necks. In the other task domain D2 the method extracted that the most 
significant sections M of the birds are the claws and the beaks, the M of the 
fishes are the fins, and the M of the bottles are the corners and the bottle necks. 
This seems a little like a human being would do. 

A further combination of the more or most significant contour sections M^, M 
of a certain contour class is desirable in order to create abstract contour 
sections A. E.g., combining all M at the position of the bottle necks into an 
A ’’BOTTLE NECK”. Thus, a problem still to be solved is the problem of 
abstraction for the single classes. First experiments with a hierarchical classifier 
based on the most significant sections M were made and give encouraging results. 
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Abstract. We use an interval graph to model the uncertainty of line 
slopes in a digital image. We propose two different algorithms that group 
lines into classes of lines that are parallel, or almost parallel. This group- 
ing is strongly based on the Helly-type property of parallelism in the 
digital plane: a group of lines is digitally parallel if and only if each pair 
of lines is digitally parallel. As a result, the extraction of parallel groups 
is reduced to the extraction of cliques in the interval graph generated 
by the slope intervals. Likewise, the extraction of lines that are almost 
parallel becomes equivalent to the detection of subgraphs that resemble 
cliques. 



1 Introduction 

In this paper we consider the graph-theoretical properties of parallel line group- 
ing in a digital image, mostly from the computational viewpoint. This work is 
part of an ongoing effort to define a sound mathematical basis for the extraction 
of geometric structure from an image while taking into account the uncertainty 
that is present. In previous work we already considered some basic geometric 
properties such as collinearity, parallelism and concurrency [8], and we studied 
some of the algorithmic aspects of the theory [9]. In this paper we propose al- 
gorithms that classify lines into groups of parallel lines, taking into account the 
uncertainty of edge positions and the fact that parallelism is not an equivalence 
relation in the digital plane. 

Any algorithm that extracts geometric structure must take into account the 
uncertainty on the position of the pixels. This uncertainty has several causes: the 
digitization process, the localization errors introduced by the edge detector, and 
the possible errors introduced by the line extraction algorithms, e.g., RANSAC or 
the Hough Transform. In fact, uncertainty can even be an artificial parameter, 
e.g., to detect automatically in an image vanishing points of the perspective 
projection, a first step is to extract groups of lines that “almost” parallel. 

In the approach proposed here, the uncertainty of point positions is mod- 
eled by defining for each point a small uncertainty region in which the point 
can lie, without knowing its exact position. This may be compared to the more 
general approach as advocated by Durrant- Whyte and others where one defines 
for each position a probability distribution [1]. These distributions are then used 
to derive new geometric distributions, e.g., for the parameters of a line passing 
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through two points. By contrast, instead of computing precise distributions, we 
only calculate in which region a distribution is non-zero. The advantage of this 
simplification is that we can go much further in the extraction of geometric infor- 
mation, without computing complicated probability measures. By comparison, 
also Lowe makes simplifying assumptions regarding the distribution of lines in 
an image and introduces an empirical significance measure for parallelism [5]. 
Lowe’s interesting view is inspired by concepts from perceptual organization, 
but it remains empirical, without a strict formal introduction of uncertainty. 

In euclidian geometry grouping into parallel classes would be trivial, as eu- 
clidian parallelism is an equivalence relation. In a digital image uncertainty on 
positions causes uncertainty on the slopes of lines, and parallelism is therefore no 
longer an equivalence relation. We must find partitions that “resemble” equiv- 
alence classes. In this work we explain how this problem can be reformulated 
as finding groups of lines that resemble cliques in the so-called graph of parallel 
pairs. This reformulation is possible because parallelism in the digital plane turns 
out to be a Helly-type property: a collection of lines is digitally parallel if each 
its pairs of lines is digitally parallel [8]. That is, the Helly-type property holds for 
the entire collection as soon as it holds for each of the collection’s n-membered 
subcollections, where n = 2. 

In this paper we show how a classification into parallel groups depends on two 
factors: first, a measure for the uncertainty on pixel positions, which we call the 
acceptable thickness, and second, an optimization criterion, which determines 
how much parallelism is required in a group of parallel lines. Thus, we may 
require that each group consists of lines that are all parallel to each other, or we 
may impose the weaker requirement that it suffices that each group contains at 
least one line to which all other lines in the group are parallel. 

In Section 2 we give a brief overview of the concepts of digital straightness, 
collinearity, parallelism, and concurrency as they are used in this work. Section 
3 introduces the so-called graph of parallel pairs, which represents all parallel 
relations between line segments. In Section 4 two different algorithms are pro- 
posed that use the graph of parallel pairs to extract large groups of parallel line 
segments. We give some concluding remarks in Section 5. 

2 Brief Overview of Geometric Properties 

In this section we briefly review results obtained in previous work [8,9]. Since 
image pixels lie on a rectangular grid, we assume that all pixels are part of the 
digital plane, although this is not strictly necessary for what follows. The digital 
plane 7Z^ consists of points that have integral coordinates. A digital set 5 is a 
subset of the digital plane. 

Several issues have influenced the development of mathematical discretiza- 
tion schemes. For computer generated images discretization schemes are used 
to obtain a digital representation of a euclidian object. For natural images they 
provide a simplified model of the complicated discretization process of digital 
image formation, i.e., a model for a CCD camera. Finally, for natural as well as 
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computer images, the discretization scheme can be used to examine the inverse 
process, i.e., given a digitized image How can we reconstruct the original image 
and the geometric structure contained within it? 

In this paper we are mostly interested in the last issue. During discretization 
the precise knowledge about position of geometric objects is lost. Or in other 
words, given the digital image we are uncertain about the position of objects 
in the original image. We shall model this uncertainty by an uncertainty region 
that we associate with each grid point. The discretization process that coincides 
naturally with this notion of uncertainty is the discretization by dilation scheme 
developed by Heijmans and Toet [2]. Let U denote a set in IR^, let H be a second 
subset of IR^, called the structuring element, and let Ap be the translate of A 
by p. Then the discretization by dilation of U consists of all points p G for 
which ApHU is non-empty [2,7]. 

Furthermore, to simplify the exposition, in this paper we shall restrict our- 
selves to one particular form of discretization by dilation, that is, we use a simple 
variant of grid-intersect discretization. To model the uncertainty of its position, 
for each digital point p = (x,y), we introduce as translate of the structuring 
element the vertical line segment Cp{r)^ which comprises all points (x, 6) G 
that satisfy y — r j2 < b < y ^ r j2. Here r is a positive real number, called 
the acceptable thickness. Up to a certain multiplication factor, this notion of 
acceptable thickness coincides with Reveilles’s notion of arithmetical thickness 
of a discrete straight line [6]. Fig. 1 shows several digital sets, their vertical 
segments, and straight lines that transverse these segments. To simplify what 
follows even further, we shall only discuss the extraction of straight lines of the 
form y = ax (5^ where the slope a satisfies —l<a<l. We assume that each 
set S contains at least two points with distinct x-coordinates. 

Although the foregoing restrictions do not change our results in a fundamen- 
tal way, they allow us to discard several special cases discussed more thoroughly 
in previous work [8], where we show, for example, that for lines whose slope lies 
between -1 and 1, vertical segments are sufficient to model position uncertainty. 
Also in previous work, other cells than vertical line segments have been consid- 
ered in a more general framework where we examine the modeling of uncertainty 
in the multitude of parameter spaces that arise during the hierarchical grouping 
of line segments [10]. 

Definition 1. A digital set S is called digitally straight if all segments Cp{r), 
p e S can be transversed by a common straight line. 

Note that the above definition of straightness is more general than the stan- 
dard definition of a digitally straight line, where one chooses r = 1, and where 
additional connectivity constraints must be satisfied. 

Definition 2. Let aS'i, . . . , a finite collection of finite digitally straight sets, 
and let Ai denote a euclidian straight line. We define the following digital geo- 
metric properties, illustrated in Fig. 1: 

— The sets Si are called digitally collinear if there exist n collinear line seg- 
ments Ai, . . . , An where each Ai transverses all the segments of Si] 
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— The sets in Si are called digitally parallel if there exist n parallel straight 
lines Ai, . . . , An where each Ai transverses all the segments of 

— The Si are called digitally concurrent if there exist n straight lines , An 

that meet in a common point and where each Ai transverses all the segments 
of 5,. 




Fig. 1. Si and S 2 are digitally collinear; Ss and ^4 are digitally parallel; ^ 4 , S^ 
and Sq are digitally concurrent 



The following result is crucial [ 8 ]: 

Theorem 1. Let S = {Si^ ^ Sn} cl finite collection of finite digitally straight 
sets. Then the collection S is digitally parallel if each subcollection containing 
two sets of S is digitally parallel. 

According to Theorem 1 all knowledge about parallelism can be derived from 
the parallelism of pairs. Although we focus in this paper mainly on the graph- 
theoretical algorithms, we give a brief overview of the geometric algorithms that 
are needed to determine whether a pair of lines is digitally parallel [9] . 

Definition 3. Let 5 be a finite digital set that contains at least two points with 
distinct x-coordinates, and let r be a chosen acceptable thickness with 0 < r. 
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Then the domain of 5, denoted as donia^(*S'; r), is the set of all parameter points 
(a, f3) G IR^ that satisfy the following system of inequalities: 

- Tj2 < axi + I3i - yi < Tj2, (xi,yi) e S. ( 1 ) 

In other words, the domain contains the parameters of all the euclidian lines 
that transverse the vertical segments of 5. In general, the domain of a large and 
sufficiently elongated set is small, since the parameters of the lines transversing 
all these segments can vary only within a small range. 

Consider for example the collection of digital sets representing lines in Fig. 2, 
whose domains are shown in Fig. 3, for an acceptable thickness r = 2. The long 
line segment O has a small domain, while the short segment K has a large 
domain. In fact, since each point of a set corresponds to two halfplanes in the 
parameter space, the domain of a large set is the intersection of a large number 
of halfplanes, and, in general, the domain will therefore be small. Furthermore, 
a domain of a digital set with N points is a convex bounded set, and can be 
computed in 0{N log N) time as an intersection of 2N halfplanes [9]. When 
we let T vary, a domain gets larger for increasing values of r. If the acceptable 
thickness is too small, however, the domain of a set will be empty. For given 
set 5, let D d IR be the set of acceptable thicknesses for which the domain of S 
is non-empty. The thickness of S is defined as the infimum of D [8]. 




Fig. 2. A collection of lines 



To reformulate the digital geometric concepts of Definition 2 in terms of do- 
mains, we need one more notion, i.e., the projection of a domain upon the a-axis 
in the (<a,/3) parameter plane. For a subset S of IR^, let 7TQ;(doma,(5; r)) denote 
the interval that results by projecting the domain of S upon the <a-axis. The 
interval 7Tc{domx{S;r)) has the following geometrical meaning: 7ro;(doma,(5; r)) 
is equal to the open interval where is the infimum of the slopes of the 
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Fig. 3. Domains of lines for the acceptable thickness r = 2 



euclidian lines that transverse the vertical segment associated with the points 
in 5, and where o? is the supremum. Fig. 4(a) shows the slope intervals that 
result by projecting the domains in Fig. 3 upon the a-axis. It follows almost im- 
mediately that two lines are digitally parallel if and only if their slope intervals 
have a non-empty intersection [8,9]. 




Fig. 4. (a) The intervals that result from the projection of the domains, and (b) 
the corresponding graph of parallel pairs 
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3 Graph of Parallel Pairs 

We shall represent parallelism by the graph shown in Fig. 4(b), called the graph 
of parallel pairs. In this graph two vertices are connected by an edge if the cor- 
responding lines are digitally parallel. Theorem 1 remains essential here, since it 
allows us to conclude that any complete subgraph (or clique) is digitally parallel, 
i.e. the cliques GJKL and GJKM in Fig. 4(b), . In other words, this conclusion 
is valid because parallelism in the digital plane is a Helly type property, that is, 
we can draw a conclusion for the entire set by examining all pairs, i.e, the edges 
of the graph. 

Before proceeding we must briefly review the definitions and properties of 
graphs that we need (see, e.g., [11]). Let G be a simple graph, i.e, a graph 
that contains no loops or double edges. An independent set is a set of vertices 
that does not contain any pair of adjacent vertices. A dominating set is a set 
of vertices such that each vertex in G is adjacent to at least one vertex in the 
dominating set. A vertex eover is a set of vertices such that each edge in G is 
incident with at least one vertex in the vertex cover. A eoloring assigns an index 
to each vertex so that two adjacent vertices always receive distinct indices. An 
optimal eoloring uses a minimum number of indices, i.e., the ehromatie number 
of the graph. A graph is perfeet if every induced subgraph has an independent 
set meeting all its maximum cliques, that is, cliques of maximal size. 

In the graph of parallel pairs, two vertices are joined by an edge when the 
slope intervals of the two lines intersect. Therefore, this graph falls into the cat- 
egory of interval graphs. Similarly, the eomplement of an interval graph belongs 
to the category of eomparahility graphs. Interval graphs as well as comparability 
graphs are perfect [11], which has important consequences from the computa- 
tional viewpoint. In general, finding an optimal coloring, a minimum dominating 
set, a minimum vertex covering, minimum clique covering, or maximum depen- 
dent set are NP-hard problems for graphs in general [3,11]. As soon as we have 
a perfect graph, however, each of these sets and coverings can found in polyno- 
mial time [3]. Interval graphs possess even stronger properties, and the above 
sturctures can often be found in linear time. 

4 Classification of Lines into Parallel Groups 

In euclidian geometry a graph of parallel pairs would consist of disjoint cliques. 
Since parallelism in the digital plane is not an equivalence relation we must 
look at techniques that extract subgraphs that are similar to cliques, and in this 
way approximate the equivalence relation of euclidian geometry. Fig. 5 shows an 
application, where long lines with slope between -1 and 1 have been extracted 
from the image of a building. Fig. 5(b) shows the graph of parallel pairs for 
T = 2. In this particular application we wanted to detect automatically some of 
the vanishing points in the image. The idea is to look for three or more lines 
that meet at a common point and whose slopes are almost equal. To reduce the 
number of possible combinations, one of the essential steps is to extract good 
candidate groups of lines that are almost parallel. 
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Fig. 5. (a) Lines detected in image of building with horizontal Sobel operator, 
(b) Graph of parallel pairs of longest lines for acceptable thickness r = 2 



We shall propose two algorithms that partition a graph of parallel pairs into 
subgraphs that resemble cliques. Clearly, the partitioning will depend on the 
amount of parallelism that is required within each group. The first algorithm 
looks for a minimal number of groups in which parallelism is maximal, that is, 
each line in the group must be parallel to each other group member. By contrast, 
we could look for groups with a minimal amount of parallelism, by extracting 
the connected components of the graph. The second algorithm, imposes a more 
natural requirement, it looks for groups in which there is at least one member 
to which all other lines are parallel. 



4.1 Classification Based on Minimum Clique Coverings 

Covering the graph of parallel pairs by a minimum number of cliques is equiv- 
alent to finding an optimal coloring of its complement which is a comparability 
graph. The algorithm that we propose here is a modification of an existing col- 
oring algorithm that uses the partial ordering induced by a comparability graph 
(see [11], page 201). We can use the fact, however, that the comparability graph 
considered here is the complement of an interval graph, and that the partial or- 
dering can be derived directly from the intervals. Each line S yields an interval 
7TQ,(doma,(5; r)) =]a^, bi[ of slopes. Let ]ui, [,..., the intervals be sorted 

according to their lowest boundary and assume that we have reassigned in- 
dices such that ai < aj whenever i < j. Note that the ordering of this sequence 
is not unique. Furthermore, the intervals induce an ordering on the complemen- 
tary graph. To be precise, we let Si < Sj whenever the slope intervals do not 
intersect and we have bi < aj. 
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Algorithm 1. Partitioning into parallel subsets (Equivalent to optimal eoloring 
of eomplement of interval graph). We start by selecting from the sorted sequence 
the interval Bi =]%-, hj [ for which hj takes a minimum value. To this interval and 
to all the intervals that intersect it we assign the first color, and we remove these 
colored intervals from the sequence. We then proceed iteratively by assigning 
the ith color to the interval Bi =]a/c, bk[ for which bk takes the minimum value 
in the remaining sequence and to all the intervals that intersect it. The algorithm 
yields a partitioning into a minimum number of cliques. □ 

In addition, after coloring the vertices, if we select one vertex of each color, 
then it is clear that we obtain a maximal independent set, that is, an indepen- 
dent set that cannot be extended further. That this independent set is also the 
largest possible follows from the Perfect Graph Theorem, according to which the 
maximal size of an independent set of a perfect graph is equal to the minimum 
number of colors needed to color its complement. 



Correctness of the Algorithm. The algorithm produces a valid coloring for 
the complement of the interval graph. In fact, we claim that the intervals are 
only assigned the same color when their common intersection is non-empty. After 
assigning m colors, let ]ak,bk[he an interval for which bk takes a minimum value 
in the remaining sequence. Then the real number bk must lie in all those intervals 
in the remaining sequence that intersect ]a/c, bk( since otherwise bk would not be 
a minimum. Thus, all intervals that are in the same coloring class have a common 
non-empty intersection. As a result they are all mutually incomparable, and 
therefore they form an independent set in the comparability graph. Furthermore, 
the coloring must be optimal, since if the algorithm uses m colors then there are 
at least m mutually disjoint intervals, or in other words, there is a clique of 
size m in the comparability graph. 



Example. Fig. 4(a) shows the slope intervals that correspond to the lines in 
Fig. 2. If we restrict ourselves to the lines of the largest component of Fig. 4(b), 
we find the following partitioning: {FT, F}, {O}, {A, AT}, {M, G, J}, {L}. This 
corresponds to a coloring of the complement of the graph of parallel pairs. Since 
we have a minimum number of color classes, the algorithm yields a partitioning 
into a minimum number of classes of parallel lines, or in other words a mini- 
mum clique partitioning of the interval graph. The lines {FT, O, A, M, L} form a 
maximum independent set. 

4.2 Classification Based on a Minimum Dominating Set 

A partitioning into a minimal number of parallel sets may not always yield the 
most natural result. Consider the example in Fig. 6. If we apply Algorithm 1, we 
find the following partitioning: {A,F}, {G, D,F}, {F}. The symmetry apparent 
in the graph of parallel pairs and the intervals does not occur in the grouping. 
In fact, in this example any minimum clique partitioning is asymmetric. We now 
propose a second technique that requests less parallelism in the groups, and often 
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leads to more natural results. It is a covering based on a minimum dominating 
set of the graph. Recall that a set of vertices is dominating if each vertex is 
adjacent to at least vertex in the dominating set. We shall group the lines such 
that each group contains one vertex of a dominating set and all its neighbors. 
Thus, in each group there will be a line to which all other lines in the group are 
parallel. Note that distinct groups may not be disjoint. 

Let S denote the set of intervals, and assume that the intervals ]ai, 6i[, . . ., 
]an,bn[ have been assigned indices according to an ordering on their left bound- 
aries, i.e., ai < aj whenever i < j. 



(a) 

AI 1 

B I 1 

Cl 1 

Dl 1 

El 1 

FI 1 



Fig. 6. (a) Slope intervals of a set of lines; (b) The corresponding graph of 
parallel pairs 



(b) 




Algorithm 2. Covering with parallel subsets (Equivalent to minimum domi- 
nating set for interval graphs). We first set the dominating set D equal to the 
empty set. Let Bj =]aj^bj[ be the interval in the sorted sequence for which bj 
takes a minimum value. We look in the sorted sequence for an interval ]ai,bi[ 
that intersects ]aj, bj [ and for which bi takes a maximum value. We add ]a^, to 
the dominating set T), and we remove and all the intervals that intersect 

it from the sorted sequence. We proceed iteratively by taking the next interval 
]a/c, 6/c[ in the remaining sequence for which bk takes a minimum value. □ 

Correctness of the Algorithm. Clearly, the resulting set T) is a dominating 
set, since it includes for each interval at least one interval that intersects it. 
Furthermore, the dominating set has minimum size, which can be seen as follows. 
Let D = {]pi, ^i[, ]p2, ^2[, • • •)} t>e the dominating set found by the algorithm, 
and let ]aj,bj [ be the interval for which bj is a minimum in the starting sequence. 

First, for the interval ]aj,bj[ there must be at least one interval in any dom- 
inating set that intersects it. Let be any minimum dominating set, and let 
]a/c,6/c[ be an interval in E^ that intersects ]aj^bj[. Then we claim that the 
set E^ = {E^\]ak,bk[)U]pi,qi[ is also a minimum dominating set. It suffices to 
prove that any interval that intersects ]a/c, bk( also intersects ]pi, qi[. Let ]a^, bm[ 
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be an interval that intersects Suppose that ]a^, 6^ [ also intersects ]aj,bj [. 

Then, ]pi, qi[ also intersects ]a^, 6^[, since bj is common to all the intervals that 
intersect otherwise bj would not be a minimum value. Suppose 

does not intersect then bj < It follows that must intersect 

]pi,qi[, because among the set of all intervals ]ai,bi[ that intersect ^j[, which 
includes ]a/c, 6/c[, we have bi < qi. 

Next, it is clear that the set E^\]pi,qi[ must be a minimum dominating set 
for the set formed by all intervals in S that do not intersect because 

otherwise itself would not have minimum size. Thus, by the same reasoning 
as above, we can prove that there exists a minimum dominating set E^ that 
contains ]p 2 , Q 2 [, and therefore there must a minimum dominating set containing 
both ]pi,<7i[ and ]p 2 ,<? 2 [- By proceeding iteratively, it follows that D is indeed a 
dominating set of minimal size. 

If we apply this algorithm to the intervals of Fig. 4(a), then we find {E, K} 
as a minimum dominating set. This dominating set yields the following group- 
ing: {T, O, A/', FT} and {AT, T, M, J, G, A/'}. Note that the line N belongs to both 
groups, as it is parallel to E as well as K. Although this grouping is not a parti- 
tioning, it leads to a more natural result than the minimum clique partitioning 
algorithm. 

One may expect that the dominating set method generally produces fewer 
groups than a minimum clique partition. This is indeed the case, as the following 
properties hold for any graph: (i) An independent set is a dominating set if and 
only if it is maximal; Thus, the size of minimum dominating set cannot be 
larger than the size of a maximum independent set; (ii) Any vertex cover is a 
dominating set; Thus, the size of minimum dominating set cannot be larger than 
the size of a minimum vertex cover; (iii) The complement of an independent set 
is a vertex cover. Hence the following relation follows: 

7 ^ (minimum dominating set) < 

min ( 7 ^ (maximum independent set), \S\ — 7 ^(maximum independent set)) . 

For example, for the complete graph ATn, the size of a maximum independent 
set is 1, and a minimum vertex cover has size n — 1. For the star shaped graph, 
defined as the graph with n vertices where there are only edges between one 
central vertex and each other vertex, the size of the maximum independent set 
is n — 1, and the size of a minimum vertex cover is 1. In both cases, however, 
the size of a minimum dominating set is 1. 

5 Concluding Remarks 

In future work, we will examine the application of the proposed algorithms to 
discretization by dilation schemes that use structuring elements that are more 
general than the vertical line segments used here. This extension seems to be 
rather straightforward, the major consequence being that the computation of 
the domains of the line segments becomes more intricate. The extension to other 
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more recent digitization schemes such as the topology preserving scheme pro- 
posed by Latecki et ah [4], and the Hausdorff discretization of Ronse and Tajine 
seems less obvious, however [7]. In the first place both schemes have been devel- 
oped to have certain desirable properties such as preservation of topology and 
reconstructability of the original object (when the grid resolution is increased). 
To capture the notion of position or line slope uncertainty was not the primary 
goal of these two schemes. 

Furthermore, we are also looking how parallel line grouping can be integrated 
into a larger framework that also involves collinearity, concurrency and proximity 
of lines. In this regard, we expect that graph theory will remain inport ant. In 
fact, whatever the way uncertainty in a digital image is represented, it is rather 
natural to represent the uncertainty of line slopes by slope intervals, which has 
led us to interval graphs. We concluded that algorithms that perform parallel line 
grouping can do this by extracting groups of vertices that resemble cliques, and 
we discussed two possibilities: (i) extract the cliques themselves, or (ii) extract 
neighborhoods of dominating sets. Both techniques have been well examined in 
graph theory, and we showed how they provide a formal way to extract parallel 
groups of lines. 
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